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This  thesis  deals  primarily  with  the  design  and  analysis  of  algorithms 
for  detection  of  narrow-band  slowly  fading  signals,  embedded  in  noise  of  un¬ 
certain  distribution.  The  signal's  phase  and  amplitude  are  random,  but  are 

random,  but  are  essentially  constant  over  the  observation  interval;  in 
adition,  the  Doppler  frequency  of  the  signal  is  unknown.  This  is  the  most 
common  model  of  target  echoes  received  by  radar  or  sonar  systems. 

Attention  is  focused  on  i.i.d.  noise  samples  whose  marginal  distribution 
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part  of  the  distribution  is  also  unknown.  This  is  an  important  generaliza¬ 
tion  of  the  model.  The  purpose  of  the  derived  detection  algroithms  is  two¬ 
fold:  first,  to  assure  constant  probability  of  false  alarm,  regardless  of 

the  noise  distribution  (DF-CFAR) ;  and  second,  to  achieve  optimal  maximin 
probability  of  detection  (P^)  performance. 

Unavoidably,  the  performance  analysis  is  asymptotic,  but  the  main 
objective  is  in  applications  involving  small  sample  sizes  (n).  Therefore, 
several  different  tests  which  are  asymptotically  equivalent  are  proposed, 
and  then  their  small  sample  behavior  is  compared  by  means  of  Monte-Carlo 
simulations.  In  all  the  algorithms,  the  test  statistic  preserves  the  struc¬ 
ture  of  the  quadrature  matched  filter,  which  is  optimal  for  signal  of  un¬ 
known  phase  in  Gaussian  noise.  However,  in  place  of  the  linear  sample  mean, 
a  minimax  robust  estimator  of  the  random  amplitude  is  substituted.  To 
generate  this  estimate,  scale  invariant  M-  (maximum  likelihood  type),  L- 
(linear  combination  of  order  statistics)  or  R-  (rank)  type  statistics  are 
utilized. 

The  DF-CFAR  property  is  obtained  by  using  additional  noise-reference 
observations,  which  are  available  in  a  matrix  format.  One  of  the  columns 
consists  of  the  samples  taken  from  the  spatial  resolution  cell  which  pre¬ 
sumably  contains  the  target,  and  the  others  are  taken  from  adjacent  cells. 
Several  canonical  structures,  denoted  as  sliding-window  (SW),  one-sample 
(IS),  two-sample  (2S),  and  single-sweep  (SS)  are  studied  (the  last  two  are 
compatible  only  with  R-  type  statistics).  A  new  type  of  IS  R-  test  is  pro¬ 
posed,  which  does  not  use  the  signs  of  the  observations  and  thus  is  CFAR 
even  for  noise  with  asymmetrical  distributions.  (This  test  requires 
bi-phase  modulations  of  the  transmitted  pulse  train.)  The  R-  type  tests  are 
inherently  DF-CFAR,  even  for  finite  n.  The  others  achieve  it  only  asymp¬ 
totically,  but  simulations  have  shown  rapid  convergence. 

In  addition  to  differences  in  complexity  and  small  sample  size  sensi¬ 
tivity,  the  tests  vary  in  their  performance  in  a  non-homogeneous  noise 
(clutter)  environment.  The  IS  is  inherently  immune  to  degradations  caused 
by  spatial  non-stationarity ,  and  the  SW  can  be  protected  in  a  simple 
manner  against  the  important  types.  On  the  other  hand,  only  the  new  SS  test 
for  signal  of  unknown  Doppler  frequency  preserves  the  complexity  pf-the  con¬ 
ventional  periodogram  test.  In  this  test,  each  of  the  complex  observations 
from  the  target *s  cell  are  ranked  with  respect  to  their  simultaneous 
noise  reference  observations,  and  then  an  FFT  is  performed  on  the  vector  of 
robustly  scored  ranks.  In  all  the  other  tests,  the  robust  estimators/sta¬ 
tistics  must  be  generated  for  each  of  the  n  unknown  frequencies.  The  behav¬ 
ior  of  the  tests  in  dependent  noise  situations  is  briefly  discussed,  and 
modifications  are  proposed. 

The  problem  of  robust  digital  M-ary  communication  in  white  non-Gaussian 
noise  is  also  studied.  With  some  simplifying  assumptions,  it  is  possible  to 
cast  it  into  a  framework  similar  to  that  of  robust  estimation  of  location 
parameters.  The  proposed  receivers  preserve  the  structure  of  the  MAP  deci¬ 
sion  rule  for  Gaussian  noise,  but  the  M  correlators  are  replaced  by  limiter- 
correlators  or  by  generalized  M— estimators .  Expressions  for  the.  error  pro¬ 
bability  are  derived  for  both  coherent  and  noncoherent  receivers;  they  are 
shown  to  be  identical  with  those  for  the  AWGN  channel,  but  the  SNR  is  multi¬ 
plied  by  the  same  functional  that  characterizes  M-estimators .  Monotonicity 
of  the  error  probability  with  respect  to  this  effective  SNR  ensures  minimax 
^^p^tormance .  “  .  ,  ■■■  ■  ■■■' 
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ROBUST  AND  NONPARAMETRIC  DETECTION 
OF  FADING  NARROWBAND  SIGNALS 


Moshe  Weiss 

ABSTRACT 

This  thesis  deals  primarily  with  the  design  and  analysis  of  algorithms  for  detection 
of  narrow-band  slowly  fading  signals,  embedded  in  noise  of  uncertain  distribution.  The 
signal’s  phase  and  amplitude  are  random,  but  are  essentially  constant  over  the  observa¬ 
tion  interval;  in  addition,  the  Doppler  frequency  of  the  signal  is  unknown.  This  is  the 
most  common  model  of  target  echoes  received  by  radar  or  sonar  systems. 

Attention  is  focused  on  i.i.d.  noise  samples  whose  marginal  distribution  belongs  to 
an  e-mixture  family,  where  the  scale  (variance)  of  the  “nominal”  part  of  the  distribution 
is  also  unknown.  This  is  an  important  generalization  of  the  model.  The  purpose  of  the 
derived  detection  algorithms  is  twofold:  first,  to  assure  constant  probability  of  false 
alarm,  regardless  of  the  noise  distribution  (DF-CFAR);  and  second,  to  achieve  optimal 
maximin  probability  of  detection  (P^)  performance. 

Unavoidably,  the  performance  analysis  is  asymptotic,  but  the  main  objective  is  in 
applications  involving  small  sample  sizes  (n).  Therefore,  several  different  tests  which  are 
asymptotically  equivalent  are  proposed,  and  then  their  small  sample  behavior  is  com¬ 
pared  by  means  of  Monte-Carlo  simulations.  In  all  the  algorithms,  the  test  statistic 
preserves  the  structure  of  the  quadrature  matched  filter,  which  is  optimal  for  signal  of 
unknown  phase  in  Gaussian  noise.  However,  in  place  of  the  linear  sample  mean,  a 
minimax  robust  estimator  of  the  random  amplitude  is  substituted.  To  generate  this  esti¬ 
mate,  scale  invariant  M-  (maximum  likelihood  type),  L-  (linear  combination  of  order 
statistics)  or  R-  (rank)  type  statistics  are  utilized. 

The  DF-CFAR  property  is  obtained  by  using  additional  noise-reference  observa¬ 
tions,  which  are  available  in  a  matrix  format.  One  of  the  columns  consists  of  the  samples 
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taken  from  the  spatial  resolution  cell  which  presumably  contains  the  target,  and  the  oth¬ 
ers  are  taken  from  adjacent  cells.  Several  canonical  structures,  denoted  as  sliding- window 
(SW),  one-sample  (IS),  two-sample  (2S)  and  single-sweep  (SS)  are  studied  (the  last  two 
are  compatible  only  with  R-  type  statistics).  A  new  type  of  IS  R-  test  is  proposed,  which 
does  not  use  the  signs  of  the  observations  and  thus  is  CFAR  even  for  noise  with  asym¬ 
metrical  distributions.  (This  test  requires  bi-phase  modulations  of  the  transmitted  pulse 
train).  The  R-type  tests  are  inherently  DF-CFAR,  even  for  finite  n  .  The  others  achieve 
it  only  asymptotically,  but  simulations  have  shown  rapid  convergence. 

Asymptotic  maximin  optimal  performance  is  shown  for  all  test  structures,  with 
L-  and  R-  statistics,  and  the  simulations  demonstrate  almost  identical  performance  of 
the  SW-  L-  type  test  (with  a-  trimmed  estimators)  down  to  n  =16.  For  the  other  types, 
robustness  of  the  against  the  distribution  shape  is  maintained,  but  they  are  some¬ 
what  less  sensitive.  The  superiority  of  the  proposed  tests  over  the  traditional  robust 
tests,  which  have  been  based  on  a  limiter-correlator,  is  also  demonstrated. 

In  addition  to  differences  in  complexity  and  small  sample  size  sensitivity,  the  tests 
vary  in  their  performance  in  a  non-homogeneous  noise  (clutter)  environment.  The  IS  is 
inherently  immune  to  degradations  caused  by  spatial  non-stationarity,  and  the  SW  can 
be  protected  in  a  simple  manner  against  the  important  types.  On  the  other  hand,  only 
the  new  SS  test  for  signal  of  unknown  Doppler  frequency  preserves  the  complexity  of  the 
conventional  periodogram  test.  In  this  test,  each  of  the  complex  observations  from  the 
target’s  cell  are  ranked  with  respect  to  their  simultaneous  noise  reference  observations, 
and  then  an  FFT  is  performed  on  the  vector  of  robustly  scored  ranks.  In  all  the  other 
tests,  the  robust  estimators/statistics  must  be  generated  for  each  of  the  n  unknown  fre¬ 
quencies.  The  behavior  of  the  tests  in  dependent  noise  situations  is  briefly  discussed,  and 
modifications  are  proposed. 

The  problem  of  robust  digital  M-ary  communication  in  white  non-Gaussian  noise  is 
also  studied.  With  some  simplifying  assumptions,  it  is  possible  to  cast  it  into  a  frame- 
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work  similar  to  that  of  robust  estimation  of  location  parameters.  The  proposed  receivers 
preserve  the  structure  of  the  MAP  decision  rule  for  Gaussian  noise,  but  the  M  correla¬ 
tors  are  replaced  by  limiter-correlators  or  by  generalized  M-estimators.  Expressions  for 
the  error  probability  are  derived  for  both  coherent  and  noncoherent  receivers;  they  are 
shown  to  be  identical  with  those  for  the  AWGN  channel,  but  the  SNR  is  multiplied  by 
the  same  functional  that  characterizes  M-estimators.  Monotonicity  of  the  error  probabil¬ 
ity  with  respect  to  this  effective  SNR  ensures  minimax  performance. 
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1.  INTRODUCTION 


1.1  Motivation  and  Outline 

The  classical  theory  of  signal  detection  is  based  on  a  complete  statistical  characteri¬ 
zation  of  the  signals  and  interference  that  are  typical  of  the  environment.  In  many  cases, 
especially  for  the  adverse  situations  in  which  radar  and  sonar  systems  operate,  this  pre¬ 
cise  modeling  is  either  complex  or  impossible.  Consequently,  there  is  a  strong  motivation 
to  study  the  design  of  detection  procedures  which  are  insensitive  or  robust  to  deviations 
from  the  assumed  statistical  models. 

Following  the  fundamental  work  of  Huber  on  robust  estimation  of  location  parame¬ 
ters  [l]  and  on  robust  hypotheses  testing  [2],  extensive  applications  and  further  research 
have  appeared  in  the  communication  and  information  sciences  literature,  of  which  refer¬ 
ences  [4]- [12]  are  a  representative  sample.  Inspection  of  these  references  reveals  that 
they  are  mostly  concerned  with  lowpass  deterministic  (i.e.,  unknown  but  non-random) 
signals,  except  [9]  which  treats  robust  detection  of  weak  stochastic  signals,  and  [12] 
where  results  have  recently  been  presented  for  deterministic  narrow-band  signals. 

In  this  thesis,  we  address  the  design  and  analysis  of  robust  detection  algorithms 
which  are  appropriate  for  radar  -  sonar  environments.  By  way  of  contrast  to  the  deter¬ 
ministic  signals  that  have  been  studied  in  the  above  mentioned  references,  we  focus  on 
coherent,  slowly-fading  narrow-band  signals,  which  are  typical  of  radar-sonar  target 
echoes.  Under  this  model,  the  signal’s  phase  and  amplitude  are  random,  but  are  essen¬ 
tially  constant  over  the  observation  period.  Furthermore,  the  frequency  Doppler  shift  of 
the  received  signal  is  unknown  a  priori,  as  it  is  proportional  to  the  target’s  radial  velo¬ 
city.  While  an  extensive  detection-estimation  theory  exists  for  this  model  when  the  noise 
is  Gaussian,  cf.  [13-15],  robust  detectors  have  not  been  investigated.  Although  the  algo¬ 
rithms  developed  in  this  thesis  share  similar  techniques  for  robustification  and  analysis, 
the  results  of  [4-12]  are  not  directly  applicable.  Moreover,  in  previous  studies  it  has  been 
assumed  that  the  ‘‘nominal”  noise  distribution  is  precisely  known.  In  this  thesis,  on  the 
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other  hand,  we  allow  uncertainty  on  its  scale,  i.e,  the  noise  power.  The  scale  is  almost 
always  unknown,  as  the  dominant  noise  sources  are  clutter  or  reverberation  echoes, 
which  vary  strongly  with  the  environmental  conditions.  Consequently,  one  can  argue 
that  scale  uncertainty  is  more  fundamental  than  distribution  shape  uncertainty. 

Scale  invariant  detection  has  been  studied  before  in  the  context  of  constant  false 
alarm  rate  detection  (CFAR);  both  adaptive  and  nonparametric  techniques  have  been 
utilized  [2^,32].  However,  the  main  objective  was  to  assure  the  CFAR  property  and,  usu¬ 
ally,  simultaneous  optimization  of  the  probability  of  detection  [Pj)  had  not  been 
attempted.  Apparently,  scale  invariant  detection  algorithms  which  are  also  insensitive  to 
the  distribution  shape  have  not  been  studied  before.  (Some  tests  based  on  rank  statistics 
are  shown  in  this  thesis  to  be  quite  robust,  but  this  is  a  fortunate  feature  and  is  not 
guaranteed  a  priori). 

Another  feature  of  our  approach  stems  from  the  characteristics  of  radar  -  sonar  sys¬ 
tems.  The  number  of  samples  (n)  is  constrained  by  other  system  requirements  (max¬ 
imum  detection  range,  spatial  resolution,  allocated  time  frame  for  the  detection  sector), 
and  typically  is  between  10  -  50  samples  or  less.  Moreover,  the  desired  levels  of  the  false 
alarm  probability  {Pfa)  are  very  low;  values  of  lO"^  to  10“®  are  typical.  Consequently, 
optimal  algorithms  that  are  the  result  of  an  unavoidable  asymptotic  analysis,  serve  as 
candidates  for  which  comparative  small-sample  simulation  studies  are  performed.  While 
this  approach  is  generally  accepted  in  the  statistical  literature,  only  a  few  of  the  works 
[4-12]  exhibit  small  sample  performance.  Moreover,  the  majority  of  the  previously  dev¬ 
ised  robust  tests  are  based  on  limiter  -correlator  structures,  where  the  test  statistic  is 
derived  from  the  sum  of  memoryless  transformations  on  the  input  observations.  For 
their  asymptotic  maximin  robust  optimality,  the  signal  is  assumed  to  be  weak.  As  might 
be  expected,  simulation  results  in  this  work  demonstrate  that  for  small  n  and  low 
the  limiter-correlator  tests  suffer  considerable  P^  losses;  clearly,  when  the  signal  is 
sufficiently  strong  to  be  detected,  it  gets  limited  by  the  detector. 
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In  this  thesis,  we  look  at  several  different  types  of  tests.  Motivated  by  the  good 
small-sample  behavior  of  robust  estimators  of  a  location  parameter  [35],  the  proposed 
tests  are  based  on  a  statistic  which  is,  essentially,  a  robust  and  scale  invariant  estimator 
of  the  amplitude  of  the  narrow-band  signal.  To  generate  this  estimator,  M-,  L-  or  R- 
type  statistics  [3]  are  used.  (Tests  based  on  M-estimators  were  proposed  by  El-Sawy  & 
VandeLinde  [6],  Rieder  [73],  and  others  for  the  simpler  case  of  deterministic  low-pass  sig¬ 
nals).  Although  these  different  types  can  be  made  asymptotically  equivalent  at  a  fixed 
noise  distribution  by  properly  choosing  their  score  function,  the  resulting  tests  differ  in 
performance  when  the  distribution  is  allowed  to  vary  over  a  “neighborhood”,  and  even 
more  so,  they  differ  in  their  small  sample  performance.  Consequently,  by  looking  at 
different  structures  it  is  possible  to  choose  the  more  appropriate  algorithm  for  a  particu¬ 
lar  application,  in  terms  of  both  performance  and  complexity. 

Scale  invariance  of  the  amplitude  estimator  is  not  sufficient  for  CFAR  operation, 
and  several  approaches  are  studied  in  this  thesis  for  achieving  it.  We  consider  input 
observations  that  are  available  in  a  matrix  format,  where  one  of  the  columns  (of  length 
n )  represents  the  samples  taken  from  the  spatial  resolution  cell  which  presumably  con¬ 
tains  the  target.  The  other  M  columns,  taken  from  adjacent  range  cells  (in  the  radar  or 
active  sonar  case),  are  considered  as  noise  ref  erence  ,  i.e.,  they  are  assumed  to  be  ident¬ 
ically  distributed  as  the  target  cell  is  under  the  null  hypothesis.  It  is  possible  to  utilize 
the  noise  reference  information  in  several  canonical  structures. 

In  “Sliding  Window”  (SW)  tests,  the  same  statistic  is  generated  for  each  of  the 
columns;  its  sample-mean  over  the  reference  window  (which  is  an  estimator  of  the  vari¬ 
ance  of  the  robust  amplitude  estimator)  serves  as  an  adaptive  threshold,  against  which 
the  target  cell  statistic  is  compared.  The  whole  structure  is  then  slid  over  the  entire 
range  coordinate,  and  decisions  are  made  for  each  of  the  cells.  In  this  way,  the  imple¬ 
mentation  complexity  is  basically  that  of  computing  the  single  column  statistic.  All  the 
three  types  of  statistics  (M-,  L-,  R-)  are  compatible  with  a  SW  mechanization. 
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“Two-sample”  tests  (2S)  are  studied  in  relation  to  tests  based  on  the  ranks  of  the 
observations  (R-tests).  In  this  class,  the  decision  is  based  on  the  ranks  of  the  target’s  cell 
observations  with  respect  to  the  entire  reference  window  observations.  In  “single-sample” 
(IS)  tests,  only  the  column  of  the  test  cell  is  utilized;  these  are  useful  when  no  reliable 
assumption  concerning  the  spatial  stationarity  of  the  noise  can  be  made.  This  class  is 
also  compatible  with  all  three  statistic  types.  Finally,  “single-sweep”  (SS)  tests  are  R- 
type,  where  for  each  observation  from  the  test  cell,  the  rank  with  respect  to  the  same 
row  in  the  matrix  is  computed,  and  the  final  statistic  is  accumulated  over  the  n  rows. 
This  is  the  only  test  that  preserves  the  computational  efficiency  of  the  linear  periodo- 
gram  (FFT)  detector,  when  implemented  as  a  bank  of  n  frequency  contiguous  tests  over 
the  Doppler  uncertainty  range. 

The  tests  based  on  R-statistics  enable  distribution  free  (DF)  CFAR  operation,  even 
for  finite  n  .  For  the  other  types  DF-CFAR  is  achieved  only  asymptotically,  but  the 
simulations  have  shown  rapid  convergence.  In  all  combinations  of  the  statistic  type  and 
reference  structure,  it  is  possible  to  design  for  an  exact  or  approximate  maximin  per¬ 
formance:  in  some  class  of  noise  distributions  the  test  is  optimal,  against  a  particular  dis¬ 
tribution,  within  a  specified  class  of  tests;  and  its  is  larger  for  any  other  distribution 
in  the  class.  However,  the  actual  P^  depends  also  on  the  noise- reference  structure.  Thus, 
for  a  particular  application,  and  for  different  assumptions  on  the  spatial  stationarity,  it 
is  possible  to  trade-off  complexity  for  performance  by  choosing  the  appropriate  combina¬ 
tion  of  statistic  type  and  reference  structure. 

As  a  consequence  of  the  asymptotic  analysis  and  simulation  study,  our  tentative 
conclusion  is  that  the  proposed  robust  detectors  emerge  as  quite  adequate  for  success¬ 
fully  treating  most  of  the  important  uncertainties  that  are  encountered  in  the  real-world 
radar-sonar  detection  environment. 
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Outline  Of  The  Thesis 

Chapter  2  introduces  the  problem  and  presents  a  performance  analysis  of  the  com¬ 
mon  (unrobustified)  quadrature  detector  under  situations  of  contamination.  Since  in 
most  applications  even  if  the  noise  distribution  is  known  its  variance  is  usually  not,  the 
adaptive  threshold  version  of  the  detector  is  emphasized.  Here,  the  test  statistic  is  nor¬ 
malized  by  the  Maximum-Likelihood  (ML)  estimate  of  the  noise  level.  While  this  test 
achieves  the  desired  CFAR  property  asymptotically,  it  is  demonstrated  that  for  small 
sample  sizes  can  increase  intolerably  under  contamination,  even  if  the  percentage  of 
the  contaminating  component  (e)  of  a  mixture  distribution  [l]  is  rather  small.  Both 
analytical  approximation  of  the  distribution  (by  means  of  the  first  terms  in  a  Laguerre 
type  expansion)  and  simulation  are  utilized.  In  addition  to  degradation,  a  large 
decrease  in  P^  is  also  possible. 

In  Chapter  3  we  study  a  very  simple  robust  test  on  the  “coherent  envelope”  of  the 
observables,  i.e.,  on  the  output  of  a  quadrature  matched  filter  which  is  the  sufficient 
statistic  under  the  purely  Gaussian  noise  case.  For  this  scalar  statistic,  extension  of 
Huber’s  [2]  maximin  robust  test  is  straightforward.  It  consists  essentially  in  finding  the 
threshold  setting  and  a  randomization  constant  under  the  least  favorable  condition. 
However,  it  is  found  that  the  proposed  test  can  protect  against  contaminations  that 
would  have  roughly  doubled  the  Py^  of  the  unrobustified  quadrature  receiver.  Beyond 
that,  the  maximin  bound  on  P^  decreases  rapidly  as  a  function  of  e.  Hence,  this  test  is 
rather  inappropriate  for  radar/sonar,  where  very  low  Py^  is  required,  and  e  can  be  much 
larger  than  the  desired  Py^ . 

In  the  following  chapters,  the  problem  is  solved  in  stages.  We  first  treat  the  case 
where  only  the  signal’s  phase  is  random;  the  full  problem  with  all  the  previously  men¬ 
tioned  uncertainties  is  addressed  in  chapters  5  and  6. 

In  Chapter  4  an  asymptotically  robust  test,  for  detection  of  narrowband  signals  of 
known  frequency  and  additive  noise  of  uncertain  distribution  with  known  scale,  is 
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proposed  and  analyzed.  The  common  large  sample  assumption  is  an  unavoidable  neces¬ 
sity  in  order  to  get  explicit  functionals  for  the  error  probabilities,  on  which  optimization 
according  to  a  maximin  criterion  becomes  possible.  However,  we  abandon  the  weak  sig¬ 
nal  assumption  that  has  dominated  the  literature.  By  avoiding  this  assumption,  we  hope 
that  the  proposed  test  will  be  adequate  for  small  sample  sizes.  The  proposed  test  statis¬ 
tic  is  derived  from  a  minimax  robust  ML  estimation  of  the  amplitude.  This  is  achieved 
by  applying  Huber’s  [l]  M-estimator  of  “location”  to  both  the  in-phase  and  quadrature 
samples,  and  then  independence  of  the  random  phase  is  obtained  by  summing  the 
squares  of  these  estimates,  exactly  as  in  the  quadrature  matched  filter.  This  test  is 
shown  to  be  maximin  robust  for  sufficiently  large  values  of  the  desired  (in  the  range 
of  practical  interest)  and  for  several  common  models  of  the  target’s  amplitude  distribu¬ 
tion.  Finally,  the  weak  signal  locally  optimal  robust  detector  for  our  problem  is  outlined, 
and  is  shown  to  be  a  local  approximation  to  the  M-estimation/detection  structure 
derived  here. 

In  Chapter  5,  further  uncertainties  that  must  be  considered  in  practical  applica¬ 
tions  are  studied.  With  regard  to  e  in  the  mixture  model,'  it  is  shown  numerically  that 
even  a  very  pessimistic  design  with  e=0.5  incurs  a  rather  small  additional  loss  compared 
to  the  case  when  e  is  known  exactly.  Thus,  exact  knowledge  of  e  is  unnecessary.  More 
important,  the  robust  M-detector  is  extended  to  handle  an  unknown  scale  (power  level) 
of  the  nominal  noise  distribution  in  the  mixture  family,  by  coupling  it  with  a  robust  esti¬ 
mator  of  the  scale,  from  which  an  adaptive  threshold  is  generated.  While  it  is  not  possi¬ 
ble  to  exhibit  maximin  properties  of  the  extended  test  in  a  IS  class  (i.e.,  when  no  refer¬ 
ence  observations  are  available),  it  is  shown  numerically  to  be  qualitatively  robust  in  the 
sense  that  an  upper  bound  on  and  a  lower  bound  on  P^  can  be  guaranteed  within 
various  parametrized  mixture  families.  The  latter  bound  is  quite  close  to  the  Pj 
obtained  by  the  optimal  detector  for  the  nominal  Gaussian  noise. 

When  noise  reference  is  available,  a  similar  estimator-detector  with  an  adaptive 
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threshold,  which  serves  as  an  estimator  of  the  variance  of  the  amplitude  estimator,  is 
shown  to  be  maximin  in  the  class  of  SW  detectors.  Due  to  its  CFAR  property  for  any 
noise  distribution,  the  restriction  for  sufficiently  high  of  Chapter  4  is  relaxed.  More¬ 
over,  the  maximin  performance  is  valid  for  arbitrary  distribution  of  the  signal’s  ampli¬ 
tude,  and  when  the  reference  window  size  M  is  also  large,  the  test  becomes  globally 
maximin  optimal  (i.e.,  without  restriction  to  the  SW  class). 

For  the  important  case  of  nominal  Gaussian  noise,  the  M-estimators  in  either  a  SW 
or  a  IS  test  can  be  replaced  by  a-  trimmed  estimators,  which  are  asymptotically 
equivalent  in  probability  and  simpler  to  compute.  The  same  structure  can  be  generalized 
for  non-Gaussian  nominal  p.d.f.  with  the  appropriate  optimal  robust  L-estimators. 
Finally,  the  signal  frequency  which  is  also  unknown  in  any  realistic  application  (due  to 
Doppler  shifts),  is  treated  by  constructing  a  bank  of  contiguous  robust  tests  which  cov¬ 
ers  the  uncertainty  range.  It  is  shown  that  the  detectability  loss  for  signals  whose  fre¬ 
quencies  straddle  between  adjacent  such  “filters”,  is  asymptotically  identical  with  that 
incurred  by  the  usual  periodogram  (FFT)  detector. 

The  price  that  is  paid  for  the  improved  capability  with  these  new  receiver  struc¬ 
tures  is  a  substantial  increase  in  signal  processing  complexity,  compared  to  the  usual 
linear  FFT  test.  However,  the  most  demanding  nonlinear  processing  task  required  here 
consists  of  several  levels  of  data  rankings,  and  this  operation  is  becoming  possible  even 
for  real-time  radar  applications  with  VLSI  and  VHSIC  technology  [16]. 

The  final  section  of  Chapter  5  presents  a  thorough  Monte-Carlo  study  of  the 
small-sample  performance  of  the  various  robust  detectors.  The  simulation  results  show 
that  the  performance  predicted  by  the  asymptotic  analysis  of  the  SW  robust  detectors  is 
essentially  maintained  even  for  sample  sizes  as  small  as  n=16,  in  guaranteeing  a  high 
lower  bound  on  the  detection  probability  as  well  as  in  controlling  the  false  alarm-level. 
For  the  IS  tests,  somewhat  larger  sample  sizes  are  necessary  for  convergence  to  the 
asymptotic  prediction.  By  way  of  contrast,  the  corresponding  weak-signal  locally 
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optimal  robust  detector  is  shown  to  produce  high  losses  for  small  desired  Pf^  and  large 
deviation  of  the  signal  frequency.  These  losses  are  attributed  to  poor  convergence  proper¬ 
ties  to  the  Gaussian  distribution  as  a  result  of  hard  limiting  on  the  test  statistic,  espe¬ 
cially  when  the  signal  is  sufficiently  large  to  allow  high  detection  probability.  On  the 
other  hand,  our  proposed  tests  limit  the  influence  of  outliers  around  the  arbitrary  signal 
amplitude,  and  they  converge  rapidly  to  the  asymptotic  values. 

Chapter  6  treats  detection  of  narrowband  signals  by  rank  based  statistics.  The 
motivation  is  twofold  -  first,  to  achieve  DF-CFAR  even  with  finite  sample  sizes;  and 
second,  to  improve  the  efficiency  of  the  SW  and  IS  tests  based  on  robust  estimators.  For 
a  noise  distribution  which  is  known  in  shape,  asymptotically  optimal  tests  in  all  three 
generic  classes  (IS,  2S  and  SS)  are  found.  Of  prime  importance  is  the  new  type  in  the  IS 
class.  It  is  shown  that  if  the  polarity  of  the  transmitted  signal  is  changed  in  half  the 
samples,  it  is  possible  to  obtain  with  a  IS  rank  test  optimal  performance,  identical  to 
that  of  the  completely  known  distribution  case,  and  without  utilizing  any  reference  sam¬ 
ples.  Identical  performance  is  possible  with  a  SW  test  only  for  large  M .  Consequently, 
the  IS  R-test  is  inherently  robust  in  a  spatially  non-stationary  environment. 

In  the  SS  class,  a  Doppler  test  that  requires  the  least  amount  of  computations  is 
found.  First,  the  target  cell  observations  are  ranked  with  respect  to  the  same  row,  and 
then  an  FFT  is  performed  on  the  complex  vector  of  the  scores  of  the  ranks.  Thus,  the 
complexity  is  roughly  O(nlogn),  compared  to  n  times  more  in  Doppler  tests  with  SW, 
IS  or  2S  structures.  This  test  is  attractive,  therefore,  in  typical  radar  systems  where 
even  the  FFT  mechanization  requires  rates  on  the  order  of  millions  of  operations  per 
second. 

For  all  the  rank  based  tests,  robustification  against  the  noise  distribution  uncer¬ 
tainty  is  achieved  by  designing  the  score  function  according  to  the  least  favorable  noise 
distribution.  We  show,  however,  .that  the  rank  tests  with  the  traditional  Wilcoxon  or 
Van  Der  Waerden  score  functions  are  inherently  robust.  Nevertheless,  the  robust  score 
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function  does  not  entail  significant  additional  computation  complexity,  and  should  be 
preferred. 

The  chapter  ends  with  a  Monte-Carlo  study  of  the  small  sample  behavior,  followed 
by  a  comparison  between  the  different  robust  tests  of  previous  chapters.  The  issues  of 
compatibility  with  a  non-homogeneous  environment,  and  with  dependent  noise  situations 
are  briefly  discussed.  It  is  not  appropriate  to  declare  one  of  the  tests  as  the  uniformly 
best,  but  it  is  possible  to  choose  the  one  more  suitable  under  specific  environmental  con¬ 
ditions,  and  for  an  accepted  complexity  in  implementation. 

The  problem  of  robust  digital  M-ary  communication  receivers  is  studied  in  Chapter 
7.  With  some  simplifying  assumptions,  it  is  possible  to  cast  the  Bayesian  multiple 
hypothesis  decision  problem  in  a  framework  similar  to  that  of  the  binary  Neyman- 
Pearson  case.  Robustification  against  non-Gaussian  noise  is  achieved  by  replacing  the 
linear  correlators,  in  the  bank  of  correlators  receiver  which  is  optimal  for  Gaussian  noise, 
with  limiter-correlators  or  with  regression  type  M-estimators,  Implicit  expressions  for  the 
error  probability  with  white,  but  non-Gaussian,  noise  are  derived  assuming  discrete  time 
processing.  These  expressions  are  identical  in  form  with  those  for  the  AWGN  channel, 
but  the  eflTective  SNR  is  now  multiplied  by  the  same  functional  of  the  M-estimators 
(both  coherent  and  noncoherent  receivers  are  considered).  Consequently,  choosing 
Huber’s  function  for  the  limiter  nonlinearity  again  results  in  minimax  performance.  It  is 
also  demonstrated  that  the  capacity  of  the  assumed  non-Gaussian  channel  can  be  much 
higher  when  compared  to  the  AWGN  channel  of  equal  noise  and  signal  powers.  There¬ 
fore,  efficient  and  robust  communication  is  possible  with  either  higher  rates  or  reduced 
transmitted  power. 
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1.2  Review  of  Huber’s  Results  for  Maximin  Testing  of  Simple  Hypotheses 

Huber  [2]  considered  and  solved  the  following  problem.  Let  (a;,-  },•=!  be  a  sequence 
of  independent  random  variables,  and  let  be  distinct  probability  measures  on 

the  real  line  with  the  corresponding  densities  {/  o;/ i}  with  respect  to  some  measure. 
Assume  that  the  likelihood  ratio  (LR)  /  1(1)//  o(x}  =  L(x)  almost  surely,  where  L(x)  is 
a  monotone  function. 

Let  M  be  the  set  of  all  probability  measures  on  the  real  line  and  0<e<l  a  given 
number.  The  uncertainty  in  the  distribution  of  the  observations  is  introduced  by 
expanding  the  simple  hypothesis  Pq  and  simple  alternative  into  composite  ones  by 
a  mbcture  model- 


Hq-  Fq  —  {Q  I  Q  — (pr^C Q,  CoGM} 

Hi-  =  {Q  GM  I  Q  =(1— e)F2+eC'|,  C'jGM} 


(1.1a) 

(1.1b) 


Actually,  Huber’s  setup  is  more  general,  it  allows  also  the  following  neighborhoods 
of  the  nominal  model:  total  variation,  Prohorov  distance,  Kolmogorov  distance  and  Levy 
distance.  Moreover,  different  e,-  are  allowed  in  the  //,■ . 


The  problem  is  to  find  the  most  robust  test  in  the  maximin  sense  between  Pg  and 
Pi,  i.e.,  to  find  a  saddle-point  pair  of  test  d*  {x)  GD,  where  D  is  the  class  of  all  decision 
rules,  and  distributions  g,*GM  such  that 


(1.2) 


subject  to 


sup  a{d,q)  =  a{d*  ,qo)  =  ag 

a  GD 

?GPo 


(1.3) 


Here,  j8(d,q)  is  the  power  of  the  test  (detection  probability)  at  density  q: 


/3(d,q)  =  Frob  {d(x)=Hi  I  Hi  is  true  (qEPi)} 


(1.4) 


and  a(d  ,q)  is  the  level  (false  alarm  probability) 


a(d,q)  =  Prob  {d(x)=Hi  I  Hg  is  true  (qGPg)} 


(1.5) 
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The  meaning  of  the  criterion  is  clear-  d*  is  the  best  Neyman-Pearson  (NP)  decision  rule 
for  the  least-favorable  pair  }. 

As  Huber  showed,  the  most  robust  test  is  a  NP  test  on  the  pair 


^0  = 


(l-e)/  o(a:) 

Ilzil 

n 


,  f  li^) 


=  1(1-£)L'/o(^ 


L{x)<L" 

L{x)>L" 

L{x)>L' 

L{x)<L' 


(1.6) 


(1.7) 


The  numbers  and  U'  are  determined  such  that  qQ  and  qi  are  legitimate  density 
functions. 


For  €  sufficiently  small  (a  condition  which  is  equivalent  to  disjointness  of  Pq  and 
Pi)  the  normalizing  equations  have  a  unique  solution  with  0<L'  <L' '<oo.  The  LR 
between  q^  and  is  thus  given  by  “soft- limiting”  the  nominal  LR  to- 


l{x-L',L") 


L  ^  when  L{x)  <  L  ' 

^  L{x)  when  L '  <  L{x)  <L  " 
L  when  L{x)  >  ' 


n 

and  a  maximin  robust  test  is  a  randomized  NP  test  on  T  (x)= 

i  =1 


(1.8) 


r(x) 


Hi 

for 

r(x) 

>t 

Hi  with  probability  c 

for 

r(x) 

=t 

Ho 

for 

r(x) 

<t 

(1.9) 


The  quantities  t  and  c  are  determined  from  the  right  side  equality  in  Eq.(l.3),  Note  that 


in  general  the  test  must  be  randomized  for  arbitrary  aQ  since  T  (x)  takes  its  values  at 


the  limiter  end-points  with  finite  probability. 


It  is  interesting  to  point  to  some  peculiarity  in  the  result.  The  least-favorable  densi¬ 
ties  are  constrained  by  (1. 6-1.7),  and  in  particular  the  contamination  Ci  can  not  be 
related  in  any  other  way  to  Cq.  While  this  is  reasonable  in  a  game  situation  against  an 
intelligent  opponent,  it  seems  unlikely  for  signal  detection  problems  where  the  uncer¬ 
tainty  in  Pi  is  induced  by  that  in  Pq  and  is  not  affected  by  the  presence  or  absence  of  an 
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additive  signal  whose  characteristics  are  assumed  to  be  known.  For  example,  in  a  “loca¬ 
tion  shift”  problem  {HoiXi—rii,  Hi:Xi=ni+a)  we  would  like  Pi  to  be  the  class  of  all 
distributions  that  are  shifted  to  the  right  by  a  from  those  in  Pq,  and  specifically 
qi{x)=qQ{x-a),  which  can  not  also  satisfy  (1.6-1. 7).  Thus,  it  is  suggested  that  a 
better  solution  might  exist  for  this  physical  formulation  of  the  problem;  unfortunately,  it 
is  unknown  The  asymptotic  reformulation  in  Chap.  4  essentially  avoids  this  peculiar¬ 
ity. 

It  should  be  emphasized  that  Huber’s  proof  of  optimum  robustness  relies  heavily  on 
two  assumptions  which  are  not  valid  for  the  problems  that  are  considered  in  this  work; 
a)  Ffg  Hi  are  simple  hypotheses-  Pq  and  Pi  do  not  include  any  unknown  (relevant 
or  nuisance)  parameters.  b)the  observations  are  independent  r.v.’s. 

1.3  Review  and  Critique  of  Robust  Tests  Based  on  Weak  Signal  Assumption 

Often  in  detection  problems,  the  true  value  of  the  signal  amplitude  A  is  small  but 
otherwise  unknown.  In  such  cases  it  is  plausible,  mostly  as  a  theoretical  nicety  which 
permits  an  analytical  (and  usually  quite  simple  for  implementation)  solution,  to  employ 
the  locally  optimum  detector  (LOD)  structure.  This  is  the  detector  structure  which 
maximizes  the  derivative  of  the  power  function  (detection  probability)  at  A=0,  for  a 
given  test  level  (probability  of  false-alarm),  [25].  Under  suitable  regularity  conditions  on 
the  densities,  the  LOD  is  identical  with  the  detector  which  is  obtained  by  taking  the 
leading  term  of  a  series  expansion  for  the  likelihood-ratio  in  powers  of  the  SNR  around 
zero  [26].  It  also  maximizes  Pitman’s  efficacy  [27]  which  is  a  suitable  weak  signal  measure 
of  performance,  when  the  number  of  the  observed  samples  n  — >oo. 

The  weak  signal  local  optimality  criterion  was  first  extended  to  problems  of  robust 
detection  by  Martin  and  Schwartz  [4],  and  has  been  widely  applied  since  then,  cf.  [5],[7|- 
[10],[12].  Instead  of  seeking  a  maximin  relation  on  the  detection  probability  as  in  Eq. 

Private  communication  with  P.J.Huber, 12/1983. 
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(1.2),  it  was  proposed  in  [4]  to  design  for  a  maximin  relation  on  the  slope  of  the  asymp¬ 
totic  power  function  at  A  =0  : 


su^^[d,qQ 
where  ^{d  ,q)  =-^^{d  ,q  \A) 


=  ^{d*,qo)  =  irif  ^{d*,qo)  (1.10) 

^^0  subject  to  the  false- alarm  constraint  of  Eq. 


(1.3).  The  interpretation  is  that  d*  \s  the  LOD  for  the  least-favorable  density  Qq.  This 
criterion  has  resulted  in  all  the  above  mentioned  references  in  a  limiter -correlator 
structure:  the  locally  optimal  non-linearity  of  the  LOD  receiver  is  robustified  against  €  - 
contamination  by  inserting  a  soft  limiter  at  its  output  and  then  correlating  the  non- 
linearly  transformed  observations  with  the  known  signal  sequence.  This  will  be  denoted 
in  the  following  as  LORD  -  LO  Robust  Detector.  . 


The  performance  of  LOD  schemes  is  commonly  evaluated  using  asymptotic  meas¬ 
ures  such  as  the  Asymptotic  Relative  Efficiency  (AJIE),  which  requires  both  the  assump¬ 
tions  of  large  n  (to  satisfy  the  central  limit  theorem)  and  vanishingly  weak  SNR.  In  any 
practical  engineering  application,  the  number  of  samples  must  be  finite  ^  and  thus  the 
input  SNR  must  be  reasonably  large  to  obtain  meaningful  detection  probability.  It  fol¬ 
lows  that  LOD  schemes  might  perform  poorly  in  practical  applications.  Several  exam¬ 
ples  of  peculiar  and  very  slow  convergence  of  the  finite  sample  RE  to  the  ARE,  can  be 
found  in  (28]-[31].  These  references  include  examples  where  the  finite-sample/large-signal 
ranking  of  detector  performance  are  actually  different  from  ARE  prediction.  Another 
new  example  is  given  in  Appendbc  A.  The  large  SNR  performance  of  the  LOD  would  be 
extremely  poor  when  the  test  non-linearity  redescends  or  even  vanishes  except  for  some 
region  around  the  origin,  i.e.  l[x)=0  VI  x  |  .  In  these  cases  we  will  obviously  get 

^  In  radar-sonar  systems,  n  is  directly  related  to  the  total  search  time  of  the  desired  sector,  to  the 
desired  maximum  non-ambiguous  detection  range,  and  to  the  spatial  resolution  of  targets.  More¬ 
over,  the  detectability  of  coherent  signals  is  (asymptotically)  governed  by  the  average  integrated 
SNR=nA^/2a^.  Since  transmitters  are  usually  constrained  by  the  average  rather  than  peak 
power,  it  is  only  the  product  that  matters  and  improved  detectability  can  be  obtained  by  in¬ 
creasing  A  as  well  as  n.  Hence,  in  view  of  the  other  system  design  goals  that  were  mentioned  in  the 
beginning  of  this  note,  n  is  usually  in  the  range  1-100,  and  even  in  1-3  samples  for  very  long  range 
systems,  in  contrast  with  the  common  theoretical  assumptions. 
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lim  /?-+0,  in  contrast  to  the  desired  consistency  of  the  power  function  with  increasing 

SNR.  This  situation  was  actually  obtained  in  [10].  In  principle,  this  undesired  property 
of  LOD  designs  might  be  corrected  by  switching  between  two  detectors,  where  one  of 
them  is  the  LOD  and  the  other  one  is  some  amplitude-consistent  detector.  The  switching 
should  occur  as  a  result  of  a  threshold  crossing  by  an  estimator  of  the  signal  amplitude 
A  .  This  estimator  must  obviously  be  robust  against  deviations  in  the  assumed  noise 
model  to  prevent  incorrect  switching.  While  this  heuristic  proposal  has  not  been 
analyzed  ,  it  does  suggest  that  an  optimal  robust  detector  should  be  based  on  a  robust 
amplitude  estimator,  as  will  be  studied  in  Chap.  4. 

Inherently,  the  LORD  scheme  is  subject  to  the  same  consistency  problem.  In  addi¬ 
tion,  investigation  of  the  previously  mentioned  references  reveals  that  they  all  exhibit 
one  or  more  of  the  following  peculiarities  and  shortcomings: 

i)  The  support  of  the  contaminating  density  c  in  Eq.  (1.1)  is  restricted  to  lie 
exclusively  on  the  exterior  of  some  interval  [-a:  1,2:1]  ,  cf.  [9],  This  might  not  be  a 
severe  limitation  since  the  greatest  deterioration  in  performance  of  the  conventional 
detectors  occurs  for  contaminations  that  are  in  the  far  tails  of  the  nominal  density. 

ii)  The  nominal  density  itself  is  not  a  member  of  the  mixture  family  Pq  in  which  a 
least-favorable  density  is  sought,  but  rather  “close”  to  it  in  some  measure,  cf.  [9], 
[12]. 

iii)  The  test  is  maximin  robust  only  for  Q:>a(e),  which  is  roughly  in  the  range  0.05-0.2, 
cf.  [4], [5],  and  [7].  Hence,  no  robustness  is  guaranteed  for  the  most  important  range 
of  small  a:  10“^-  10“^. 

It  is  clear  that  the  maximin  relation  (1-10)  is  neither  sufficient  nor  necessary  for 
obtaining  the  desired  maximin  solution  in  terms  of  the  detection  probability  itself.  Since 
Eqs.  (1.2)  and  (1.3)  are  contradictory  in  nature  when  A  can  take  any  value  in  some  inter¬ 
val,  the  most  that  can  be  expected  from  the  LORD  approach  is  as  shown  in  Fig.  1.1:  the 
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shaded  region  on  the  0  axis  indicates  the  a's  obtained  for  the  optimal  detector  for  any 
density  in  the  mixture  family;  the  slope  of  any  0{qo)  at  A=0  agrees  with  (1.10)  and 
(1.2)  is  satisfied  for  all  signal  amplitudes  that  are  greater  than  some  critical  value  : 
^  or  equivalently  0^/3^  . 

However,  the  power  function  of  a  LORD  could  as  well  behave  as  shown  in  Fig.  1.2 
where  (1.3)  and  (1.10)  are  satisfied,  but  the  curve  of  /?(^o)  dominates  that  of  y5(^o)  V 

A  >0  and  Vgo  SPq*  (Other  situations  are  also  possible).  Nevertheless,  if  Pq  the 

distance  between  the  curves  in  Fig.  1.2  could  be  made  sufficiently  small,  i.e.- 
\  I3{d*  jqQ)-P{d*  jqQ)  |  <<5(e,ao,A  )— ►O,  the  detector  would  be  practically  robust,  though 
not  in  the  most  general  maximin  sense.  (In  that  case,  actually  becomes  the  most- 
favorable  density  for  the  detection  probability  !) 

Closer  study  of  the  structure  of  all  the  LORD  tests  in  the  above  mentioned  refer¬ 
ences  reveal  that  implicit  in  all  of  them  is  a  “robust  estimator  of  zero  amplitude”  -i.e., 
the  test  is  actually  based  on  an  approximation  of  the  leading  term  in  the  expansion  of  an 
appropriate  version  of  Huber’s  robust  estimator  in  a  power  series  around  zero  amplitude. 
This  observation  establishes  another  motivation  for  the  asymptotically  robust  test  of 
Chap.  4  which  utilizes  robust  estimation  of  the  amplitude  without  any  small  signal 
approximation,  and  thus  essentially  avoids  all  of  the  above  mentioned  problems. 
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Fig.  1.1  Qualitative  power  function  curves  for  .a  desired  maximin  robust  detector 
of  signal  with  unknown  amplitude. 


Fig.  1.2  Qualitative  power  function  curves  for  a  locally-optimal  robust  detector 
which  is  not  maximin  with  respect  to  the  detection  probability. 
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2.  THE  NARROW-BAND  SIGNAL  AND  NOISE  DETECTION 
ENVIRONMENT 


2.1  Models  and  Problem  Statement 

The  problem  to  be  considered  is  that  of  robust  detection  of  a  slowly  fading 
narrow-band  signal  with  unknown  phase  and  amplitude  in  nearly  Gaussian  narrow-band 
noise.  This  signal  model  is  typical  of  most  coherent  radar  targets  [13]  and  also  appears  in 
some  sonar  problems  and  also  in  communication  over  tropospheric  links  [14]. 

Specifically,  the  discrete  signal  plus  noise  samples  that  are  observed  at  the  systems 
input  terminals  are 

z-  —  Asicos{ojti  +  (f>)  +  rii  ji  =  l,...,n  (2.1) 

The  are  narrow-band  noise  samples  around  the  known^  center  frequency  oj  : 

Tii  —  nf.^cosojt  -  Tig^smujt  (2.2) 

are  assumed  to  be  zero  mean,  independent  ^  and  identically  distributed  (i.i.d) 
random  variables  (r.v.),  from  a  nominally  Gaussian  e  -contaminated  mixture  : 

/  =  n  /  o(«c,)/  o(«»,)  (2.3) 

i=l 

where 

/  o(«c;  =  a; )  =  /  o(«».  =  a: )  =  (l-f)  ^  exp(-^)  +  €c{x)  ,  c  GM  (2.3a) 

v27r<T  2cr 

(f>  is  the  unknown  signal  phase  which  is  common  for  all  samples  (coherent  detection) 
and  uniformly  distributed  over  [0,27r].  It  does  not  convey  any  information  on  the  target 
and  is  thus  a  nuisance  parameter  which  has  to  be  averaged  out.  {5^-  is  a  sequence  of 
known,  positive,  amplitude  modulations^  with  The  amplitude  A  which 


The  case  of  unknown  Doppler  shift  will  be  treated  later  by  the  usual  “bank  of  filters” 
approach. 

Notice  that  sample  -to  -sample  independence  either  excludes  the  common  situation  of 
detection  in  correlated  clutter,  or  it  corresponds  to  a  sub-optimal  scheme  where  the  ob¬ 
servables  are  obtained  by  pre-  whitening  the  input  sequence. 

^  Known  phase  modulations  can  be  introduced  without  much  complication. 
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is  also  constant  for  all  the  observed  samples  will  be  considered  in  the  sequel  as  either 
deterministic  but  unknown  or  random  with  some  known  distribution  of  unknown  scale^. 

It  is  well  known,  cf.  [13],  that  in  the  purely  Gaussian  noise  case  (e  =  0),  a  uni¬ 
formly  most  powerful  (UMP)  test,  independent  of  the  amplitude  being  deterministic  or 
random  with  an  arbitrary  distribution,  is  equivalent  to  a  threshold  test  on  the  “coherent 
sample  envelope”  R  (x,y)  : 

^  (x,y)  =  (  E  2:,-  Si  )^  +  (  E  Vi  s.-  f  (2-4) 

i  =  l  i=l 

where  the  in-phase  and  quadrature  samples  are  obtained  by  applying  the  input  to  a  pair 
of  ideal  lowpass  mixers: 

Xi  =  2zi  coscijt  I  ipp  =  Asj  cos(p  +  n^.  (2.5) 

Vi  =  2zism0i)t  I  =  Asi  sin<j)  +  (2.6) 

Since  the  quadrature  components  are  obtained  by  a  reversible  operation,  they  are 
equivalent  statistics  for  the  problem,  which  can  thus  be  reformulated  as: 


H,: 


Xi  = 

'  Vi  =  rie. 


(2.7a) 


Xi  =  n^.  +  Asi  cos^ 
yi  =  -f  A  Si  sincf) 


(2.7b) 


It  is  desired  to  obtain  a  maximin  relation  like  Eq.  (1.2)  for  all  A  >Ac  (recall  the  discus¬ 
sion  in  Section  1.3)  subject  to  the  false-alarm  constraint  Eq.  (1.3). 

Direct  application  of  Huberts  [1]  solution  for  this  problem  is  inappropriate  since  the 
samples  under  Hi  are  actually  dependent  due  to  the  common  random  phase  even 
when  A  is  deterministic  and  known.  However,  a  “naive”  extension  of  Huberts  test  is  pos- 


These  slow  fading  assumptions  correspond  to  common  radar  targets  whose  cross  section 
(RCS)  fluctuates  with  a  correlation  time  much  longer  than  the  “blip”  duration  nT,  T= 
intersample  period,  but  much  shorter  than  the  scan- to-  scan  period.  This  is  a  most  fre¬ 
quent  situation  [13]. 
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sible  if  one  considers  (f>  2iS  a,  non-random  but  unknown  and  adopts  an  “estimation- 
detection”  procedure.  A  set  of  M— t-co  tests  is  performed,  where  the  test  is  matched 
to  <f)j=j  /2nM  and  its  objective  is  a  decision  about  the  presence  of  a  signal 
of  phase  (pj.  Each  of  the  individual  tests  is  then  of  the  correlator -limiter  type  found 
in  [4],  but  the  limiter  break-points  are  now  functions  of  the  s,-  and  also  of  (pj.  Since 
phase  information  is  irrelevant,  the  final  decision  can  be  reached  by  accepting  if  at 
least  one  of  the  individual  tests  has  accepted  it. 

We  have  not  studied  how  many  parallel  tests  have  to  be  performed  to  achieve 
acceptable  performance  with  arbitrary  signal  phases  between  the  adjacent  “filters”.  It  is 
conjectured  that  it  might  be  prohibitive,  especially  when  the  Doppler  shift  is  also  unk¬ 
nown  so  that  a  two-dimensional  “filter-bank”  (phase,  frequency)  has  to  be  constructed. 

In  the  absence  of  any  optimal  procedure  of  design  for  the  finite  sample  problem,  we 
suggest  to  weaken  somewhat  the  requirements.  We  will  do  that  in  Chap.  3  by  assuming 
that  the  observable  is  only  the  envelope  statistic  of  Eq.  (2.4)  rather  than  the  original 
quadrature  samples.  We  can  then  construct  a  most  robust  test  against  the  uncertainty 
in  the  distribution  of  that  scalar  statistic. 

In  Chap.  4,  we  will  extend  the  asymptotic  estimation-detection  approach  of  El- 
Sawy  and  VandeLinde  [6]  to  our  problem,  utilizing  some  of  Huber's  [l]  results  on  robust 
estimation  of  a  location  parameter.  It  is  shown  that  if  A  is  replaced  by  a  robust 
minimax  estimate,  the  resulting  test  statistic  has  a  limited  robust  property. 

2.2  Performance  Degradation  of  the  Conventional  Envelope  Detector. 

In  this  section,  the  degradation  in  performance  of  the  envelope  detector  of  Eq. 
(2.4),  (which  is  the  UMP  detector  for  slow  fading  narrow-band  signals  in  narrow-band 
Gaussian  noise)  will  be  analyzed  when  it  operates  in  a  background  which  comes  from  a 
Huber-Tukey  mixture  family.  The  decrease  in  detection  probability  as  well  as  the 
increase  in  false  alarm  probability  are  of  interest,  where  the  latter  is  of  an  extreme 
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importance  in  an  automated  search  system.  These  results  will  serve  as  a  basis  for  com¬ 
parison  with  the  performance  of  the  robust  detectors  that  are  developed  later  in  this 
work. 

Radar  and  sonar  systems  usually  operate  in  a  highly  non-stationary  and  non- 
homogeneous  noise  background.  Except  for  the  receiver  thermal  noise  which  is  station¬ 
ary  and  Gaussian,  the  statistical  properties  of  interference  that  is  the  outcome  of  exter¬ 
nal  sources,  (such  as  clutter  reflections  from  ground  sea  or  aerial  objects  in  the  radar 
environment,  or  ambient  noise  and  reverberations  in  the  sonar  case  [14],  [22]),  rapidly 
change  in  time  and  space.  Therefore,  even  when  the  Gaussian  assumption  is  adequate, 
the  fixed  threshold  detector  based  on  the  envelope  statistics  of  Eq.  (2.4)  is  of  little  value, 
since  its  actual  false-alarm  rate  will  fluctuate  intolerably  according  to  the  changes  in  the 
background  noise  level.  A  common  adaptive  detector  for  these  situations,  which  is 
invariant  to  the  power  level  of  the  background  noise,  compares  the  envelope  statistics 
from  the  “test-cell”  with  an  adaptive  threshold  derived  from  a  “noise-reference”  chan¬ 
nel.  Specifically,  {x,y}  represent  the  narrow-band  observation  samples  in  the 
hypothesis- testing  cell,  {u,v}  are  the  noise-reference  samples  which  are  assumed  to  have 
the  same  distribution  as  that  of  {  x,y  }  under  Hq.  The  adaptive  test  is: 

d{x,y,u.v)=Hi  f  ^(x,y)>ilT(u,v)  (2.8) 

where 

i?=/2+g2.  (2.9) 

”i=l  ”  1 

1  ^ 

(2.10) 

j=l 

(2.11) 

”t=i  ”j=i 

The  threshold  multiplier  t  is  determined  as  to  achieve  the  desired  false  -alarm  rate. 
(Notice  that  there  are  M  reference  vectors  for  the  test  vector,  all  of  dimension  n). 

In  search  systems  where  a  target  presence  is  to  be  detected  in  some  spatial  sector, 
the  noise  reference  samples  are  easily  obtainable  by  applying  the  same  signal  processing 
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of  the  hypothesis  testing  channel  to  adjacent  resolution  cells  in  range,  Doppler  or  bearing 
coordinates,  c.f.  [l8]-[20].  This  detector  is  known  as  the  Cell-  Averaging  CFAR(CA- 
CFAR),  Mean  Level  Detector(MLD),  or  Sliding  Window  Detector  (SW).  When  the 
number  M  of  reference  samples  grow  asymptotically,  its  performance  approaches  that  of 
the  UMP  detector  for  the  fixed-variance  Gaussian  noise.  Its  structure  is  also  almost 
identical  with  that  of  a  narrow-band  version  which  can  be  derived  from  the  Two-sample 
Student’s-t  test  [21],  which  has  some  optimum  properties  among  unbiased  and  invariant 
tests  for  detection  in  Gaussian  noise  of  unknown  level.  (In  the  extended  t-test,  the  noise 
variance  is  estimated  from  both  the  test  and  reference  samples  in  contrast  to  (2.8). 
However,  the  difference  in  performance  is  quite  small  when  M  is  sufficiently  large,  and 
the  structure  of  Eq.  (2.8)  enables  SW  mechanization  where  R  as  well  as  are 

generated  sequentially  by  the  same  hardware  during  the  search). 

In  the  following  we  assume  that  the  background  interference  has  an  epsilon- 
contamination  mixture  density,  where  the  nominal  as  well  as  the  contaminating  densities 
in  Eq.(2.3)  are  Gaussian:  /  ={l-e)N {0,l)+eN The  performance  of  the  detector  is 
discussed  first  when  the  number  of  samples  n  is  asymptotically  large,  and  secondly  for 
the  finite  sample  case. 

a)  n  -H-oo 

When  n  is  asymptotically  large,  all  the  quantities  I,  Q,  C/y,  and  Vj  of  Eqs.(2.9)- 
(2.11)  are  Gaussian  random  variables.  Hence,  it  is  obvious  that  this  detector  is  asymptot¬ 
ically  nonparametric  under  the  null  hypothesis;  i.e.,  it  is  CFAR  for  any  probability  den¬ 
sity  of  the  input  test  and  reference  samples  for  which  the  central  limit  theorem  holds. 

Under  the  alternative,  for  a  Rayleigh  distributed  slow-fading  signal  the  detection- 
probability  of  this  detector  is  uniquely  determined  by  the  integrated  Signal- to- Noise 
ratio  (c.f.  [18],  [20]):  SNR  —nA^j2a^.  Since  (j^=(l-e)-(-ec ^  ,  it  is  clear  that  even  a  small 
amount  of  high  power(c^)  contamination  can  reduce  the  effective  SNR  by  orders  of  mag¬ 
nitude,  thus  spoiling  detectability  completely. 
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b)  Finite  n 

With  regard  to  the  detection  probability  a  similar  behavior  as  above  is  to  be 
expected,  as  it  is  not  significantly  affected  by  the  tails  of  the  distribution.  However,  it  is 
no  longer  true  that  the  detector  remains  CFAR  for  moderate  and  small  sample  sizes. 
Since  in  practice  radar-sonar  systems  are  designed  for  very  low  false  alarm  probabilities, 
it  is  the  tails  of  the  distribution  of  the  test  statistics  that  counts,  and  even  a  slight  devi¬ 
ation  from  Gaussianity  of  the  input  samples  will  suffice  to  substantially  increase  the  area 
under  the  tails. 

In  Appendix  B,  an  approximation  is  derived  for  the  distribution  of  the  coherent 
envelope  R,  which  approaches  the  desired  one-sided  exponential  distribution  for  large 
n.  ^  The  approximation  is  based  on  an  expansion  of  the  distribution  in  a  series  of  ortho- 
normal  Laguerre  polynomials,  where  its  leading  term  is  (with  proper  normalization  such 
that  E(R)=1)  the  Gamma  density 


where  q;=(1-|-A:  /2n  )  ^  and 
Gauss-Gauss  mixture  by 


f{R)- 


R 


0L-\ 


r(c.) 


R  >0 


k  is  the  kurtosis  of  the  input  samples. 


(2.12) 


This  is  given  for  a 


k 


E{x^) 

E\x^) 


-  3  =  3e(l 


c2-l 


1  -  e  +  €c  ‘ 


(2.13) 


Notice  that  when  e  0  but  ec ^  »  1,  A;  — 3/e  »  1.  Table  2.1  demonstrates 
that  k  can  be  very  far  away  from  zero  for  a  Gauss-Gauss  mixture,  thus  a  is  also  much 
smaller  than  1  leading  to  higher  false-alarm  probability. 


^  Actually,  an  exact  analytic  expression  was  derived  for  the  special  case  of  a  Gauss- 
Gauss  mixture,  but  it  was  found  to  be  very  difficult  to  compute  numerically. 
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— c 

3.162 

10 

31.62 

d.ooi 

0.01 

0.1 

.238/.992 

2.02/.941 

6.06/.841 

24.3/.568 

73.5/.303 

22.37.589 

748.5/.041 

245.4/.115 

26.477.547 

Table  2.1  Kurtosis  (left  entry)  and  a  (right),  Gauss-Gauss  mixture,  ri=16. 

Figures  2.1  thru  2.5  depicts  the  false  alarm  probability  vs.  the  threshold  multiplier 

t,  where  t  is  set  according  to  the  Gaussian  assumption.  In  each  figure,  the  graphs 
marked  “known  sigma”  corresponds  to  the  adaptive  threshold  detector  of  Eq.(2.8)  with 
M  — »-oo;  in  that  case  the  variance  estimation  is  error-free  and  the  detector  is  identical  in 
performance  to  the  UMP  detector  for  the  known  variance  case.  The  graphs  marked  “ 
e— 0”  give  Pf^  when  the  input  is  indeed  Gaussian.  In  all  figures,  n==16  and  M=8.  This 
is  sufficiently  small  for  poor  convergence  to  Gaussianity  of  the  sample  means  in  the  tails, 
but  is  representative  of  the  number  of  samples  actually  employed  in  most  systems.  The 
adaptive-threshold  curve  with  e=0  was  computed  from  Pf^—{l+t  fMy^ ,  c.f.  [18],  [20]. 
The  Laguerre  approximation  was  computed  only  for  the  M  — *-oo  case,  since  for  the  adap¬ 
tive  threshold  test  no  similar  approximation  is  possible.  ^  The  squares  were  obtained 
from  a  Monte-Carlo  simulation;  it  is  observed  from  the  figures  that  the  Laguerre  approx¬ 
imation  is  quite  reasonable  and  agrees  closely  with  the  simulation  results  in  its  region  of 
low  variance  (roughly  down  to  the  reciprocal  of  the  number  of  runs  in  the  simulation). 
The  main  conclusion  from  the  figures  is  that  the  increase  in  is  very  high,  for  desired 
values  of  10“^  and  lower  whenever  a<0.9,  and  that  this  deterioration  is  ordered  in 
correspondence  with  the  deviation  of  k  and  a  from  their  values  at  the  Gaussian  density. 
This  increase  in  Pf^  is  much  more  severe  for  the  adaptive- threshold  detector  with  finite 
number  of  reference  cells;  in  Fig.  2.5,  for  example,  the  threshold  setting  for  10“®  at  the 
Gaussian  produces  about  3T0~^  at  the  contaminated  mixture. 


2  •  •  • 

Since  it  is  not  possible  to  obtain  analytic  expressions  for  the  moments  of  the  ratio  R/W 
of  Eq.  (2.8).  While  it  is  possible  to  derive  a  similar  approximation  to  the  distribution  of 
W  and  then  to  integrate  numerically  Pr{R>tW},  this  approach  was  not  taken. 
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THRESHOLD 


Tig.  2.1  false-alarm  probability  vs.  threshold  multiplier.  Normal-Normal 
mixture,  e=0.001,c=3.  162, k=. 2364, a=. 9926, 10,000  monte-carlo  runs. 
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Hg.  2.2  false-alarm  probability  vs.  threshold  multiplier.  Normal-Normal 
mixture,  c=0.0l.c=3  i62,k=2  025. a=  9405,50.000  monte-carlo  runs 
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Fig.  2.3  false-alarm  probability  vs.  threshold  multiplier.  Normal-Normal 


mixture.  £=0.1, c  =  3  I62.k=6.06.a=  8400.10.000  monte-carlo  runs 
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Fig.  2.4  false-alarm  probability  vs.  threshold  multiplier.  Normal-Normal 


mixture,  t-0  l,c-]0.,k=22  27.a=.5896,: 0.000  monte-carlo  runs 
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Fig.  2.5  faJse-alarm  probability  vs.  threshold  mulUplier.  Normal-Normal 


mixture,  i  —0  01  ,c~ .0. ,k-73.51  .a  — .3033, 10,000  monte-carlo  runs. 
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3.  ROBUST  TEST  ON  THE  SAMPLE  ENVELOPE 

When  non-contaminated  hypotheses  contain  nuisance  (vector)  parameters  with 
known  distribution  they  are  actually  simple  hypotheses,  and  the  optimal  test  is  a 
Neyman-Pearson  test  on  the  likelihood  ratio  of  the  averaged  densities: 

L  (x)  =  i(x  I  e^)\/Ee^[f  o(x  |  ^?o)l  (3.1) 

An  attempt  to  incorporate  averaging  into  the  proof  of  optimality  of  Huber’s  test  fails 

due  to  the  fact  that  the  marginal  density  is  in  general  not  a  product  of  densities  of  the 

n 

components  a:,-:  Eg\f{y.\e)]y^I{^Es\f{xi  |  6')],  unless  n  =  1.  When  n  =  1,  the 

structure  of  Huber’s  test  extends  to  the  case  where  there  are  nuisance  parameters;  the 
only  correction  needed  is  to  replace  L  (a: )  of  Section  1.2  by  r(a:)  =  E  g\f  i(a;  |  6)]/ f  q{x), 
provided  it  is  a  monotone  function  of  x . 

Therefore,  in  the  absence  of  an  optimal  procedure  for  the  design  of  a  robust  test  on 
the  original  observables  {z,-,  y,- },-=i  of  Eq..(2.7),  it  is  proposed  to  weaken  the  require¬ 
ments  and  to  consider  instead  a  most  robust  test  on  some  scalar  statistic.  Since  for 
e  =  0  the  sample  envelope  R  (x,  y)  =  {J]  a:,-  +  (^  y,-  f  is  a,  sufficient  statistic  for  the 

problem,  and  its  distribution  is  independent  of  the  phase,  it  is  natural  to  try  to  robustify 
a  test  based  on  R  . 

Thus  we  will  consider  the  modified  problem: 

Pq  =  )  S  M  I  g  (i?  )  =  (1  -  e')  /  q[R  )  +  q[R  ),  c  q  G  M} 

Pl=  {q{R)eM  \  q{R)  =  {l-e^)  f  ^{R  )  +  c  ^{R  ),  c  ^  €  M} 

where  e'  is  the  amount  of  contamination  on  the  envelope  R. 


If  one  knows  how  to  find  6,  of  the  original  observables  (by  physical  reasoning,  estima¬ 
tion  or  guessing),  he  should  be  able  to  apply  his  procedure  for  finding  ,  Nevertheless,  it 
can  be  verified  that  if  the  uncertainty  in  the  measure  is  given  by  a  mixture  model,  non¬ 
linear  operations  on  random  variables  do  not  change  e  and  summation  of  n  i.i.d.  contam¬ 
inated  r.v’s  results  in  =  i  -  (i  -  ^  from  which  we  get  «  2n  e, . 
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We  now  apply  this  approach  to  Gaussian  contaminated  quadrature  noise  com¬ 
ponents  and  Rayleigh  distributed  signal  amplitude. It  is  well  known  [13]  that  the  nom¬ 
inal  densities  are  given  by: 

f  x{R  \A  )=af  exp[-c7/(R  +n^A^)]Io{A  -/R  jcT^)  ,  R  >0  (3.2) 

/  i(R  )=£[4  [/ i(R  I  A  )]=[o-|(l-f5'„  )]"^exp[-R /orJ(H-5'„  )]  ,  R  >0  (3.3) 

where  a‘^=Ef  a^=2na^,S  =Aq  /2(T^  is  the  input  signal  to  noise  ratio 

under  the  nominal  conditions,  A  q  =E  {A  ^)  and  S„  =nS . 

The  least  favorable  pair  is  given  by  Eqs.(l. 6-1.7),  with  /  i(R )  from  (3.3)  and  /  o{R  ) 
by  the  same  expression  with  S„  =  0,  and 

r(R  )=(l+5„  )"Wp(R'R  )  ,  R>0,  K  AS„/a^{l+S„)  (3.4) 

As  L  (R)  is  monotone,  an  equivalent  test  statistic  is 


T{R)  =  < 


(rjr 


f 


R 


where  the  limiter  breakpoints  are  found 
of  Appendix  C.  These  equations  yield 


R  ^  cr^r' 

,  a^r'  <  R  <  a^r"  (3.5) 

>  R  >  c^r" 

by  solving  the  normalization  equations  (C.2-C.3) 


exp[-r  '(l+5„  )-']+(l+5„  )-^exp[r  'S„  (l+5„  )-i][l-exp(-r  ')]-(l-€')"'  (3.6) 

l-exp(-r  ")+{l+S„  )exp[-r  "  S„  (1+S'„  )-^]exp[-r  "(l+5„  )-i]=(l-e')-i  (3.7) 


It  is  shown  in  Appendix  C  that  the  resulting  maximin  test  can  take  three  different 
forms,  depending  on  e  and  the  desired  a^.  Roughly,  when  e/ao  is  large,  it  is  a  random¬ 
ized  test  where  Hi  is  decided  with  probability  c  <1  if  R  >^.  For  intermediate  values  of 
e/ao,  it  is  a  deterministic  threshold  test  on  R  ,  and  for  small  values,  Hq  is  decided  with 
probability  c,  if  R  .  Equation  (3.7)  is  explicitly  solved  by 


72) - ^ - 

^  ^  This  corresponds  to  a  Swerling  I  target  model  [17],  which  is  a  very  good  description  of 
the  radar  cross  section  (RCS)  fluctuations  for  microwave  frequencies,  where  the  target  size 
is  much  larger  than  the  wavelength  so  that  the  quadrature  components  are  due  to  sum¬ 
mation  of  many  independent  reflectors  and  the  central  limit  theorem  holds  -  [13]. 
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r^'  =ln[5„(l-£')  A'  ]  (3.8) 

while  solutions  for  the  first  one  can  be  found  numerically  from  the  set  of  solutions 

{a  <  1}  of 

S„  a+a  ,  a  =exp[-r' (H-5„  )*^]  (3.9) 

Table  3.1  shows  the  values  of  the  “sufficiently  small”  ej  for  which  Pq  and  are  dis¬ 
joint  and  the  maximin  test  exists  [L'  {(.')<L'  '{e'),  \/e'  <€<,'  ).  These  are  given  as  a 
function  of  the  integrated  signal  to  noise  ratio  S„  ,  the  natural  measure  of  distance 
between  the  hypotheses  for  this  problem. 


10-2 

10-1 

1 

10 

102 

102 

oo 

3.5  •  10-2 

4  •  10-2 

.2 

.42 

.49 

.498 

.5 

Table  3.1  Values  of  critical  contamination  vs.  integrated  nominal 
signal-to-noise  ratio  5„  =  nE {A^)/2a“. 

A  suitable  measure  of  performance  of  a  robust  test  is  how  far  is  the  lower  bound  on 
the  power  l3{d* ,  q*],  from  the  power  of  the  Neyman-Pearson  test  for  e=  0  ,  Pq.  For 

our  problem  Pq  =  aQ^^^  ^  and  using  (3.7)  with  (C.ll)  and  (C.15)  of  Appendix  C, 


/3{d*,  q*) 


1  +  5„ 


.  1-e' 


sj[^-  +  sA 


P  < 


P  > 


Sn  »0 

1  +  -Fn 

■S'nQ'O 
1  +  'S'n 


(3.10) 


The  different  expressions  correspond  to  cases  b)  and  c)  in  the  appendix.  The  last  one  is 
valid  for  those  desired  values  of  ag  when  the  limiter  is  “effective”  and  one  must  resort  to 
a  randomized  test,  which  in  turn  causes  faster  decrease  of  the  power  curve.  Case  a)  of 
the  appendix  is  never  applied  in  this  Rayleigh  signal  example,  as  it  was  found  numeri¬ 
cally  that  it  corresponds  to  false-alarm  probabilities  higher  than  0.5. 


Figures  3.1  -  3.2  depict  P{d*  ,q  *)  versus  P  for  different  values  of  P  and  5„  .  All  the 
graphs  are  clearly  characterized  by  a  sharp  “knee”  at  P^aQS^  /(l+5'„ )  ag,  such  that 
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when  e'  is  larger  than  this  critical  value  the  power  of  the  test  deteriorates  rapidly. 
Notice  that  even  an  increase  of  orders  of  magnitude  in  the  effective  signal  to  noise  ratio 
S„  does  not  help  to  alleviate  the  problem.  The  figures  also  show  the  false  alarm  proba¬ 
bility  of  the  optimal  detector  for  the  uncontaminated  case  ,  when  it  actually  operates  in 
a  worst-case  contamination  under  Hq:  a{N.P.  ,w.c.  )=aQ{l-e')+e'.  (Under  the  worst 
case  power  is  (l-€')/^o  which  is  insignificantly  lower  than  /3q  for  the  small  e's  considered). 

Thus  we  conclude  that  the  proposed  test  can  protect  against  contaminations  which 
would  have  roughly  doubled  the  false  alarm  probability  of  the  Neyman-Pearson  test 
(e';^2Q;o).  Beyond  that,  this  protection  is  achieved  at  an  intolerable  price  of  decrease  of 
the  power.  In  radar  applications,  Oq  is  very  small  and  its  variation  is  significant  only  for 
an  order  of  magnitude  changes.  Thus  the  test  proposed  in  this  section  is  not  very  useful 
in  many  practical  situations. 

A  somewhat  similar  robust  detector  for  a  Rayleigh  signal  in  nearly  Gaussian  noise 
was  recently  studied  by  Shin  and  Kassam  [11].  Motivated  by  insight  gained  from  the 
structure  of  various  robust  detectors,  they  suggested  inserting  a  limiter-squarer  non¬ 
linearity  at  the  outputs  of  the  in-phase  d quadrature  Q  channels, 

summing  the  outputs  and  comparing  to  a  threshold.  The  amount  of  limiting  was  optim¬ 
ized  numerically  and  the  performance  was  analyzed.  Comparing  Fig.  6  of  [11]  with  Figs. 
(3.l)-(3.2)  here,  it  is  observed  that  the  scheme  of  [11]  is  somewhat  better.  However,  the 
numerical  analysis  of  [11]  was  carried  out  only  dox  the  range  e  <  Oq,  and  it  is  not  known 
whether  the  performance  degrades  further  when  e  is  orders  of  magnitude  higher  than 

At  the  least,  this  comparison  demonstrates  that  it  is  advisable  to  “push  back  the 
limiting”  (or  more  precisely,  an  adequate  censoring  of  the  outliers  as  will  be  clear  from 
Chapter  4)  as  far  as  possible  from  the  threshold  comparison  point  to  the  original  observ¬ 
ables. 

Private  communication  with  the  authors. 
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Figure  3.1  a)  False  alarm  probability  that  would  have  resulted  when  worst-case  con¬ 
tamination  is  applied  to  the  UMP  test  for  purely  Gaussian  noise,  vs.  e!  b)  Maximin 
bound  on  the  detection  probability  vs.  e\  ag  =  10“^ 
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Figure  3.2  a)  False  alarm  probability  that  would  have  resulted  when  worst-case  con¬ 
tamination  is  applied  to  the  UMP  test  for  purely  Gaussian  noise,  vs.  el  b)  Maximin 
bound  on  the  detection  probability  vs.  el  Oq  =  10“^ 


-  33  - 


4.  ASYMPTOTICALLY  ROBUST  TEST  FOR  KNOWN  NOMINAL  SCALE 

4.1  Introduction 

In  this  chapter  a  test  will  be  derived  which  is  asymptotically  (n  — >  oo)  most  robust 
in  a  maximin  sense.  The  scale  of  the  nominal  distribution  is  assumed  known  and  refer¬ 
ence  samples  are  not  utilized.  In  the  following  chapter,  we  will  show  by  means  of  simula¬ 
tion  that  this  test  maintains  its  performance  even  for  small  sample  sizes. 

Utilizing  some  of  Huber’s  [1]  results  on  robust  estimation  of  a  location  parameter, 
El-Sawy  and  VandeLinde  [6],  Rieder  [73],  Kassam  et  al.  [10]  and  others  derived  asymp¬ 
totically  robust  tests  for  the  problem  of  detecting  a  completely  known  signal  in  additive 
noise  of  uncertain  distribution.  A  similar  approach  is  possible  (and  turns  out  to  be 
optimal)  for  the  problem  formulated  in  Section  2.1.  We  observe  the  similarity  between 
the  narrowband  slowly  fading  coherent  signal  and  the  lowpass  deterministic  signal:  for  a 
given  received  sample  {x,  y},  A  cos(j>  and  A  sincj)  are  essentially  unknown  location 
parameters  for  the  quadrature  signal  components.  Moreover,  in  purely  Gaussian  noise 
the  unknown  phase  (which  is  a  nuisance  parameter)  is  “averaged  out”  in  the  UMP  test 
statistic  R  =  /^  +  where  I  and  Q  are  the  sample-means  of  the  in-phase  and  qua¬ 
drature  components  of  the  narrowband  observations.  I  and  Q  are  the  Maximum- 
Likelihood  estimates  of  the  component  locations  and  thus  R  is  a  good  estimate  of  the 
amplitude  (squared)  which  is  the  “true”  parameter  of  the  problem,  i.e.,  the  one  that  dis¬ 
tinguishes  between  Hq  and  Hi. 

Building  further  on  insight  gained  from  the  UMP  statistic  R  ,  we  observe  that  in 
the  purely  Gaussian  case  the  detection  probability  E ^Prob  {R  >t  \  A  ,(j>}  is 

analytically  tractable  as  a  result  of  the  unbiasedness  and  Gaussianity  of  I  and  Q  under 
both  hypotheses.  Since  in  order  to  exhibit  a  maximin  property  for  a  composite  testing 
problem,  it  is  necessary  to  obtain  analytical  expressions  for  the  error  probabilities  func¬ 
tionals,  it  appears  natural  to  preserve  the  structure  of  the  R  test,  but  to  replace  the 
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sample-mean  estimates  with  robust  estimates  of  the  locations  {A  cos<l>j  A  sin^},  which 
will  have  the  following  properties.  The  estimates  should  be: 

i)  Shift  invariant  (or  unbiased)  for  all  n  . 

ii)  Efficient  and  consistent. 

iii)  Conditioned  on  A  and  (j)^  asymptotically  Gaussian. 

iv)  Possess  the  minimax  property  on  the  estimation  variance  (which  is  translated 

into  the  effective  SNR  when  iii  is  valid). 

v)  Maintain  the  above  properties  for  medium  and  even  small  n  . 

Huber’s  M-estimator  [1],  and  the  a-trimmed  mean  estimator  (when  the  nominal 
p.d.f.  is  Gaussian)  have  all  these  properties.  This  facilitates  the  proof  of  maximin 
optimality  of  the  proposed  test  when  the  amplitude  is  non-random.  For  random  ampli¬ 
tudes  which  are  of  more  interest  in  practical  applications,  the  expectation  of  the  detec¬ 
tion  probability  over  the  amplitude  must  be  taken.  It  turns  out  that  for  the  most  fre¬ 
quently  used  target  model  in  radar  applications  (Rayleigh  amplitude  or  Swerling  case  I), 
the  maximin  property  of  the  proposed  test  is  only  “almost”  preserved  (recall  Fig.  1.2), 
but  it  is  maintained  exactly  for  a  higher  order  chi-squared  amplitude  model.  (Rayleigh  is 
first  order  while  constant  amplitude  is  the  limit  of  the  chi-square  family  when  the 
number  of  degrees  of  freedom  tends  to  infinity). 

For  the  sake  of  continuity  and  convenience  in  the  exposition,  we  will  begin  with  the 
non-random  amplitude  case,  and  then  move  to  the  more  realistic  random  amplitude 
models.  Throughout  this  chapter  the  amount  of  contamination  e  in  the  Huber-Tukey 
model,  as  well  as  the  variance  of  the  nominal  p.d.f.,  are  assumed  to  be  known.  These 
strong  assumptions  which  are  inherent  in  previous  work  on  robust  detection  [4]-[l2],  but 
are  rarely  satisfied  in  practice,  will  be  relaxed  in  Chapter  5. 
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4.2  Non-random  Signal  Amplitude 

Before  presenting  the  test,  we  need  some  definitions.  Define  a  class  of  functions  ^ 
on  R  ^  such  that  dL  (a; )/ dx  =l  (2)6'^  if: 

1)  L  is  convex,  symmetric  about  the  origin  and  increasing  for  positive  argument. 

2)  /  is  continuous. 


3)  for  all  9  e  Pq,  0  <  [^^(2:)]  <  00. 


(4.1) 


4)  for  all  g  6  P 


0,  E.  [I  (a;  -  0)]  exists  and  is  nonzero  at  ^  =  0. 

Ou 


Define  the  estimates  by  the  following  implicit  equations 


{T,Sil{xi-SiA^)=0}  ,  Ay{l)=Arg{f]sil{yi-SiAy)=0}  (4.2) 


t=l 


(Ax  and  Ay  are  the  robust  M-type  estimates  [l]  for  the  location  parameters  A  cos(f>  and 
A  sin(;ii  of  the  observables  in  Eq.(2.7b),  for  a  given  realization  of  the  random  variables  A 
and  <f).)  Define: 


I'„((o)=[i.('o)P+|.4,(;„)p  (4.3) 

where 


^0(2:)  = -d  [log  go  (a:)]/da:  (4.4) 

and  ^0  €  Pq  minimizes  Fisher’s  information: 

H9o)  <  Hi)  =  J  [-^  q{x)]^  q{x)dx  ,  V  g  6  Pq  (4.5) 

Consider  a  threshold  test  on  r„(/o) 

(  ,  T„{lo)  >  t„{lo)  =  t{lo)/n 

dn(x,  y)  I  ^  otherwise  (4-6) 

and  let  T„{1)  be  any  other  test  based  on  Ax(/)  and  Ay(/)  with  I  G ’1'  but 
L  -log  go-  The  following  proposition  states  the  asymptotic  maximin  properties  of 

the  test  d*,  which  we  will  subsequently  refer  to  as  the  SSQME  (Sum  of  SQuared  M-type 
Estimates)  test. 
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Proposition  4.1  Consider  the  detection  problem  defined  on  the  observables  of  Eq.  (2.7) 
with  the  uncertainty  in  the  independent  quadrature  noise  samples  obeying  an  e-  mixture 
model  Pq  with  symmetrical  p.d.f.  When  A  is  nonrandom  and  <j>  is  uniformly  distributed 
on  [0,  27r],  among  the  class  of  all  tests  (/)  based  on  Aj^l)  and  Ay{l)  of  Eq.  (4.2)  with 
I  there  exists  a  least-favorable  p.d.f.  go  €  Pq  and  a  constant  0  <  <1  such 

that  the  SSQME  test  defined  in  (4.6)  with  n  — +  co  achieves: 

sup /?(/,  go)  = /5(/o,  9o  )  ==  inf  q)  (4.7) 

*  ^  €  Pq 

whenever  g^ )  >  n.nd  subject  to 


sup  a{l ,  q)  =  a{lo,  q^)  =  (4.8) 

?  e  Po 

Moreover,  when  the  nominal  p.d.f.  in  Pq  is  Gaussian,  the  nonlinearity  /q  is  a  sym¬ 
metric  soft  limiter  whose  breakpoints  depend  on  the  noise  variance  but  not  on  the  signal 
amplitude: 


-K/(r,  K/(j)  = 


X  /a^ 

-K{e)/<r 


X  >  K{e)(j 

-K{e)(T  <  X  <  K{e)(j  (4.9) 
a:  <  ~K{e)(r 


with  K  (e)  and  e  related  by 


/  o{K{e))/K{e)  -  (e))  =  e/2(l  -  e) 


(4.10) 


and  /  o(a:)  =  d^{x)/dx  is  the  standard  normal  p.d.f. 


Comments 

1)  Note  that  the  theorem  shows  the  saddle-point  pair  of  p.d.f.  and  test  only  among  a 
certain  class  T  [I ).  We  would  like  the  left  relation  in  (4.7)  to  be  valid  for  any 
hypotheses  test  on  the  original  observables  {x,  y}.  Unfortunately,  this  could  not 
be  proved,  unlike  the  case  of  a  deterministic  signal  in  [6],  as  will  be  clear  in  the 
sequel.  However,  the  class  r(/)  is  quite  large.  As  limiting  cases  it  contains  the 
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sample  means  of  x  and  y  when  e  =  the  sample  medians  (e  =  1)  and  with 
weak  regularity  conditions  which  usually  holds  [l]  and'^y  can  be  any  transla- 

tion  invariant  statistics,  such  as  provided  {^{i)  are  the 

i=l  i=l 

rank  ordered  samples  <  xjg)  <  •  •  •  <  ),  and  similarly  for  Ay. 

2)  The  theorem  remains  valid  (but  /q  given  by  a  different  expression  than  (4.9))  for 
any  other  description  of  the  uncertainty  in  the  noise,  provided  the  asymptotic  vari¬ 
ance  of  the  M-estimates  (4.2)  computed  with  /q  is  the  maximum  achievable  over  the 
class  of  densities  in  Pg.  An  example  is  the  P  family  [6].  Others  can  be  found  in 

[1],  [3]. 

3)  In  contrast  to  Huber’s  finite  sample  size  test,  this  asymptotic  test  satisfies  a  more 
desired  and  natural  optimality  criterion.  Namely,  the  maximin  relation  for  the 
power  Eq.  (4.7)  is  specified  in  terms  of  the  uncertainty  in  Pq  alone,  allowing  the 
uncertainty  in  Pj  to  be  induced  by  that  in  Pq  as  a  shift  of  the  densities  of  the  qua¬ 
drature  samples  (recall  the  discussion  in  Section  1.2), 

4)  Extension  to  unknown  frequency  is  straightforward  by  constructing  a  parallel  bank 
of  SSQME  tests,  the  best  being  matched  to  =  i  jnT  where  T  is  the  sam¬ 
pling  period.  This  will  be  discussed  in  more  detail  in  Section  5.3.  The  individual 
test  is  shown  pictorially  in  Fig.  4.1.  Note  that  a  routine  for  solving  for  a  zero  of  a 
function  is  needed. Thus,  a  real  time  implementation  in  a  radar  or  sonar  system 
calls  for  substantially  larger  number  of  calculations  than  encountered  in  the  more 
common  detection  systems  that  incorporate  linear  filtering,  FFT  and  at  most,  pass¬ 
ing  the  samples  through  a  nonlinearity. 


Thus  the  SSQME  test  is  better  (at  least  asymptotically)  than  the  “scalar”  envelope  test  which 
was  derived  in  the  preceding  chapter. 

^  ^  This  is  true  except  for  the  case  where  the  nominal  p.di.  of  the  uncertainty  family  is  Gaussian, 
where  a  considerable  simplification  is  possible  as  will  be  shown  later. 
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Figure  4.1-  Block-diagram  structure  of  SSQME  test  for  robust 
detection  of  slow-fading  narrowband  signal. 


Proof  of  proposition  4.1  We  need  to  study  the  asymptotic  distribution  of 
T{l),l  €'1'.  It  can  be  derived  using  the  following  lemmas  from  [l],  with  a  slight 
modification  for  time  varying  signal  {s,- }  from  [6]. 

Lemma  1  Whenever  /  €  ^  and  q  is  any  symmetric  p.d.f.,  conditioned  on  A  and  (p, 
'/ri  {A  ,.{l)-Ax)  and  \/n^  {Ay{l)-Ay  )  are  asymptotically  distributed  as  normal  r.v.’s  with 
zero  mean  and  variance  V’(/,  9) 


V{1.  q) 


c 


A  lim  i-  E 


00  Tl 


(4.11) 


where  the  parameters  )  are  the  true  values  of  the  locations  (0  under  Hq, 

=A  cos<f>  and  Ay  =A  sin^  under  Hi). 
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Lemma  2  Let  q  e  Pq  be  an  e-con taminated  mixture  p.d.f.  where  the  nominal  p.d.f.  /  q 
is  symmetric  and  twice  continuously  differentiable,  such  that  -log  f  q  is  convex  on  the 
convex  support  of  F q.  Then,  there  exists  Qq  E  Pq  which  minimizes  Fisher’s  information 
(4.5)  over  Pq  and  an  /q  6  ^  given  by  (4.4),  and  the  asymptotic  variance  of  the  M- 
estimates  satisfies  a  saddle-point  relation 

sup  V(lo,  q)  =  V(lo,  q^)  =  ^  inf  V[l,q*)  (4.12) 

Specifically,  is  given  by 


go(a;)  =  (1  -  e)- 


-  /  X  iC(a;  -  In)  ^ 

f  o{xo)e  X  <  Xq 

/  o(^)  aro  <  a;  <  2:1 


(4.13) 


and  Xi  are  the  endpoints  of  the  interval  where  |  /oW/ol  and  K  is 

related  to  e  through 


/  f  ({x)dx  -h 
^0 


f  0(^0)  +  /  0(2^1) 
K 


1 

1  -  e 


(4.14) 


We  note  that  the  right  inequality  in  (4.12)  is  a  consequence  of  A^{Iq)  and  aly(/o) 
being  the  maximum  likelihood  (ML)  estimates  of  the  corresponding  location  parameters, 
when  the  underlying  p.d.f.  is  q^,  under  both  hypotheses.  In  addition,  they  are  efficient 
under  the  assumed  conditions  on  i.e.,  they  achieve  the  Cramer-Rao  lower  bound 
/“^(go)-  The  left  inequality  follows  from  the  fact  that  gg  minimizes  /(g)  over  Pg. 
Relaxation  of  the  symmetry  requirement  and  other  noise  uncertainty  models  are  treated 
in  [1],  [3]  and  [6]. 


From  Lemma  1  and  the  mutual  independence  of  x  and  y,  A^{1)  and  Ay{l)  are 
independent  and  jointly  Gaussian  when  cf>  is  fixed.  Averaging  over  (f>  is  straightforward 
with  transformation  to  polar  coordinates  and  yields 
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/  (i  ),Ay{l)\A)=E^[f  {A  ,(/  ),  i  y(/  )  I  >1  ,  0)/  (^)] 


(4.15) 


n 

Tin  +  A^  ' 

r 

2tV{1,  q) 

(M 

1 

_ 1 

^0 

[Vil,q)/n_ 

,  T{1)  ^ 


where  Iq{-)  is  the  zeroth  order  modified  Bessel  function.  Also, 


/  {T{1)  I  A  )  =  TT/  {A  An,  A  An  I  A  )  (4.16) 

Now,  restricting  ourselves  to  the  class  of  tests  based  on  Ax(/)  and  Ay(/)  as  the 
new  observables  of  the  problem,  the  likelihood-ratio  is  A(A  ^(1 ),  A  y(/ )  |  A  )= 
exp(-nA  ^/2 F)  /o[nAr^^^(/ )/F]  which  is  monotone  increasing  in  T{1)  for  any  A  ,  deter¬ 
ministic  or  random.  Therefore,  in  view  of  the  Neyman-Pearson  Lemma  T  (/ )  is  a 
sufficient  statistic  and  a  threshold  test  based  on  it  is  UMP  for  iJp  A  >0  vs. 

Hq.  a  =  0. 

Unlike  the  known  lowpass  signal  case  that  was  treated  in  [6],  it  can  not  be  proved 
here  that  T  {1)  is  a  sufficient  statistic  of  the  original  observables  {x,y}.  There,  where 
the  additive  signal  was  non-random,  use  was  made  of  theorems  by  Wald  [23]  and 
Chernoff  [24]  that  a  test  based  on  the  ML  estimate  6  (i.e.,  solution  to  X)U^t~^)=0  with 
l=[-logf  )'  e^)  is  asymptotically  equivalent  to  a  NP  test.  In  our  problem,  T{Iq)  is  not 
the  ML  estimate  of  A^  when  the  underlying  quadrature  noise  p.d.f.’s  are  . 

O 

However,  its  expected  value  is  £’(r(/))  =  A  ^  H - V  [I ,  q).  Thus,  it  is  asymptotically 

n 

unbiased  and  the  variance  of  T{Iq)  has  a  minimax  property  by  virtue  of  Eq,(4.12). 
Indeed,  it  can  be  shown  that 


Var{n^/^T{n)  =  4A^V{l,  q){l  +  ^  V{1,  q)]  ^  4A^V{l,q) 

^  n  » 

Hence,  is  a  good  robust  estimate  of  A  ,  and  as  was  discussed  before,  the  class 

The  MLE  of  A  is  given  by  solving 

n 

d  log  E^l  n  5o  (x,--  A  cos  0)  qo  (j/,-  -  A  sin  <?i)]/rfA  =  0 
1=1 

which  is  analytically  intractable  unless  is  identically  Gaussian. 
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{T {1)}  is  quite  large. 


Having  shown  that  in  the  restricted  class  of  tests  the  optimal  procedure  must  be  a 
threshold  test  on  the  envelope  T{1),  it  remains  to  validate  Eq.(4.7)  subject  to  (4.8). 
Using  (4.16)  with  the  definition  of  Marcum’s  Q-function  Q{a,b)= 


/a:exp[— ^(x^  +  a^)]  Iq{  ax)dx ,  and  recalling  that  when  I  7^  Iq  we  are  free  to  adjust  the 

h  ^ 


threshold  t{l)  in  (4.6)  such  that  a(/,  while  for  the  pair  (/q,  q)  the  threshold 

remains  ^(/o)?  fhe  various  false-alarm  and  detection  probabilities  are  as  summarized  in 
Table  (4.1). 


Q;(/,  q) 

l3{l,q)=Q[a  ==A^nlV{l,  q),  b  =  g  )] 

(1,  ?o) 

exp[-t{l)/2V{[,  qo)] 

a  =  A  ^Jn  /V{1 ,  q^),  b  =  b^ 

(^Oi  <lo  ] 

exp[-t{lo)/2V{lo,  go)] 

a  =  A  yjn  /V[Iq,  q^)  ^  Qq,  b  =  y/-2  log  Oq  A  b  q 

(/o,  q) 

exp[-i(/o)/2F(/o,  g)] 

a  =  A  y/n  /V[Iqj  g  )  ==  Uqa/  ^{^Oj  Qq)/^{^07  q) 

b  =ViUo}/V{lo,  q)^b,y/v{lo,  q*o)/V[l^,  q) 

Table  4.1-  False  alarm  and  detection  probabilities  for  various  test-p.d.f.  pairs. 


Note  that  the  parameter  a^/2  defined  by 

,2 


2  ^  ~  2V[l,  q) 


(4.16a) 


is  the  “effective  integrated  SNR,”  by  analogy  with  the  UMP  detector  for  the  purely 
Gaussian  noise  where  the  integrated  SNR  is  given  by  the  same  expression  with  V  , 

From  the  first  and  second  row  of  the  table,  by  monotonicity  of  the  Q-function  with 
respect  to  the  first  argument  (a)  when  the  second  is  fixed,  and  since  from  Lemma  2 
^inf^  V{1,  qo)=  V{Iq,  q^),  we  get; 
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su^  I3[l,  qo)  =  qo) 

This  proves  the  left  hand  side  of  (4.7). 

From  the  first  column  of  the  second  and  third  row  of  the  table,  since  the  exponent 

function  is  monotonic  increasing  when  V  increases,  by  virtue  of  the  other  inequality  in 

Lemma  2,  we  get  sup  a(/o,  q)  =  a{lQ,  Qq).  It  remains  to  find  under  what  conditions 
?  ePo 


/?(^o.  q)  =Q{aoC  ,boG)  >  P{lo,  qo),  where  C  {q ,  qo)=  ?o)/^(^o>  q)  >  1  for 

all  ^  €  pQ*  Note  that  the  Q  function  is  monotonic  in  either  of  its  arguments  only  when 
the  second  is  fixed,  but  here  they  both  change  proportionally  and  the  inequality  is 
analytically  intractable.  A  sufficient  condition  is  found  by  solving  for  Uq  with  a  fixed 
those  values  which  satisfy  d/3{lo,  q)/dC  >  0,  (7  >  1.  Using  known  properties  of  the 
partial  derivatives  [15,  Appendix  F]: 


)  =y  exp[-f-±.^.  ]/^(xy )  ,  i£.^iyl==-yexp[-^^^]Io{xy)  (4.17) 


dx  "  ‘  ‘  2  ^  '  dv  ^  2 

we  obtain  an  equivalent  relation  which  is  sufficient  for  the  right  hand  side  of  Eq.  (4.7) 


IiiaoboC^)  bo 
-  -  ^  ^  ^  \i  C  >1 


(4.18) 


From  the  definition  of  the  modified  Bessel  functions,  Ii/Iq  is  monotonic  increasing 
from  0  to  1  when  the  argument  increases  from  0  to  infinity,  so  it  suffices  to  solve  (4.18) 
for  C  —  1.  From  the  latter  argument,  and  since  it  is  necessary  that  the  slope  of  the  pro¬ 
bability  of  detection  functional  at  C  =1  be  non-negative,  this  becomes  a  necessary  con¬ 
dition  as  well.  By  continuity  and  monotonicity  of  both  sides  of  Eq.(4.18),  a  solution 


exists,  and  upon  fixing  the  false  alarm  probability  (6o  —  \/-2  log  ao)  and  by  virtue  of 
the  monotonicity  of  the  § -function  with  respect  to  Uq,  the  solution  of  Eq.  (4.18)  with 
equality  defines  a  critical  value  such  that  when  Qq)  =  Q(ao,  bo)  ^ 

inf  )0(/o,  q)  =  /3(/o)  ^  •  This  completes  the  proof  of  proposition  4.1 

9  €  Po 
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Remark  In  the  above,  we  have  assumed  independence  of  the  in-phase  and  quadra¬ 
ture  noise  variates.  However,  the  same  result  is  valid  also  if  they  are  circularly  sym¬ 
metric,  as  this  property  is  sufficient  for  asymptotic  independence  of  the  quadrature  M- 
estimates  (This  is  a  special  case  of  Appendix  D,  where  the  distribution  of  a  “Doppler- 
bank”  of  SSQME  tests  is  derived).  As  a  consequence,  the  SSQME  test  is  better  than  the 
“quasi  robust”  narrowband  test  that  has  been  proposed  by  Kassam  [12]. 

Some  numerical  values  of  are  given  in  Table  4.2.  Jt  is  seen  that  for  typical  false 
alarm  levels  ^  0.60. 


Sn 

10-^ 

10-2 

10"® 

10-^ 

10-^ 

10-® 

0 

ao 

2.41 

3.20 

3.86 

4.41 

4.92 

6.16 

— >  6f) 

=  Q  [Qq,  ^n] 

0.68 

0.64 

0.62 

0.60 

0.60 

0.59 

Q  f^n,  ^nl 

Table  4.2  -  Critical  values  of  /?(/o,  9o)  for  which  the 
SSQME  test  is  maximin  robust,  non-random  amplitude. 

We  note  that  the  test  in  [6]  for  a  completely  known  signal  is  asymptotically  maxi¬ 
min  robust  only  for  =  0-5  (regardless  of  oq),  a  fact  that  was  not  directly  stated 
there.  Closer  examination  of  the  situation  in  [6]  reveals  that  =  0-5  is  due  to  the 
symmetry  of  the  normal  p.d.f.  of  the  test  statistics  around  its  mean  under  both  Hq  and 
H i-  This  is  not  the  case  for  the  SSQME  test,  and  results  in  a  higher  value.  Neverthe¬ 
less,  when  designing  a  radar,  sonar  or  communication  system,  the  desired  goal  is  in  high 
quality  detection.  Consequently,  systems  are  specified  to  achieve  ,0=0.8-0.95,  and  the 
technical  parameters  which  result  in  the  required  SNR  are  designed  accordingly.  There¬ 
fore,  obtaining  the  robustness  of  the  SSQME  test  only  for  a  limited  range  of  /3  is  of  no 
practical  limitation. 

From  the  expression  for  0{lo,  ^o)  in  Table  4.1,  it  is  clear  that  the  maximin  bound 
is  only  slightly  inferior  to  the  detection  probability  in  the  uncontaminated  case,  where  ^ 
is  given  by  the  same  equation  but  with  a  =  A  \fn  /cr.  This  is  so  since  for  small  e  , 
rv* {e)  AV{Iq,  qo)lcr‘^  1,  as  can  be  seen  in  Table  4.3.  (See  also  [3,  p.  87]). 
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e 

0 

.001 

.002 

.005 

.01 

.02 

.05 

.1 

.15 

.2 

.25 

.4 

rv*(e) 

1 

1.01 

1.02 

1.04 

1.07 

1.12 

1.26 

1.49 

1.75 

2.05 

2.40 

4.0 

Table  4.3-  Upper  bound  on  the  estimation  variance  (and  upper  bound  on  the 
loss  in  the  effective  integrated  SNR),  rv*{e)  A  V[Iq, 

SSQME  test  for  narrowband  signal  with  non-random 
amplitude  in  nearly  Gaussian  noise. 

The  corresponding  maximin  bound  on  the  detection  probability  is  depicted  in  Fig. 
4.2  for  various  values  of  ao,  along  with  the  results  of  the  robust  test  on  the  envelope  of 
Chap.  3  (marked  RET).  All  the  curves  are  computed  with  fixed  input  SNR  =  A 
such  that  the  detection  probability  at  the  nominal  Gaussian  p.d.f.  for  all  ag’s  is  0.9. 
Very  similar  behavior  was  obtained  for  other  SNRs.  A  dramatic  improvement  in  perfor¬ 
mance  compared  to  the  RET  is  clearly  observed.  While  the  effectiveness  of  the  RET  is 
limited  to  contaminations  which  are  smaller  than  roughly  twice  the  desired  false-alarm 
probability,  (note  that  RET  curves  with  ag  <C  10  ^  are  outside  of  the  range  of  the  figure) 
the  SSQME  test  can  protect  against  e  which  is  of  orders  of  magnitude  higher  than  a^, 
before  the  detection  probability  decreases  substantially,  roughly  up  to  e  =0.1  -  0.2. 
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Figure  4.2-  Maximin  bound  on  the  detection  probability  vs.  €,  narrowband 
signal  with  non-random  amplitude  and  unknown  phase  in  nearly  Gaussian  noise. 
Full  lines:  SSQME  test.  Dotted  lines:  RET. 


€ 
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4.3  Random  Signal  Amplitude 

In  radar-sonar  applications,  modeling  targets  with  non-random  amplitude  is  an 
oversimplification.  ^  Since  in  nominal  Gaussian  noise  the  coherent  envelope  test  is  UMP 
regardless  of  the  signal  amplitude  statistics,  it  seems  natural  to  try  to  also  use  its  robust 
version,  the  SSQME  test,  for  random  amplitudes.  Therefore,  we  will  study  the  robust¬ 
ness  of  the  SSQME  test  by  averaging  the  asymptotic  power  of  the  deterministic  case 
with  respect  to  the  assumed  amplitude  p.d.f.  (Obviously,  the  false  alarm  probability  is 
bounded  as  before.)  Two  cases  will  be  treated. 

a)  Swerling  Case  1  Target. 

This  is  the  most  common  radar  target,  composed  of  many  independent  scatterers. 
The  amplitude  is  Rayleigh,  f  (A)  =  2A  exp(-yi  for  A  >  0,  A^  AEiA"^).  It 

turns  out  that  the  performance  of  the  SSQME  test  is  somewhat  less  desirable  than  that 
for  the  deterministic  signal  case. 

Proposition  4.2  Under  the  conditions  of  Proposition  4.1  and  a  Rayleigh  distributed  sig¬ 
nal  amplitude,  the  SSQME  test  with  the  nonlinearity  /q  of  Eq.(4.4)  satisfies 

subject  to  Eq.(4.8)  -  i.e.,  Qq  is  the  least-  favorable  p.d.f. 
under  Hq  but  the  most  -  favorable  under  Hi.  However,  when  the  nominal  noise  p.d.f.  in 
Pq  is  Gaussian  and  /g  is  the  soft  limiter  of  Eq.(4.9),  the  right  side  inequality  of  Eq.(4.7) 
can  be  replaced  by 

q)>  C  0{lo,  qo)  ,  0<c<l  (4.19) 

f  e  Po 

such  that  the  constant  C  (which  generally  depends  on  e,  and  q^))  approaches 
unity  as  £—*-0,  and  as  yS(/g,go  )— >1. 

Proof  Upon  averaging  with  respect  to  A  ,  the  power  of  the  SSQME  test  is  easily  found 
to  be 

'  Except  when  the  target  is  a  perfect  reflecting  sphere. 
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(3{l,  q)  =  exp 


-t{iy2V(l,  q) 
1  +  5(/,  q) 


(4.20) 


where  S  {I ,  q)  A  nX^ /2V[l,  q).  Thus,  with  the  threshold  i  (/ )  set  as  in  the  first  row  of  Table 
4.1, 


9o)  =  «o 


1/[1  +  5(1,  ?•)] 


(4.21) 


Clearly,  ^(/ ,  q^)  is  decreasing  in  V[l ,  q^),  hence  su^  ,  q^)  =  ^(/q,  q^).  Unfortunately, 
the  right  side  inequality  of  (4J)  is  never  satisfied  here: 


^(/o,  q)=  Oi 


y\r+s[i^ 

0 


?o*)l 


r 


A 


y{l^,  q) 

V{1„  q*o) 


<  1  V  7  e  Po 


(4.22) 


Thus  /?(/o,  go).  which  is  given  by  (4.22)  with  r  =  1,  is  always  higher.  This  is  actually  the 
second  case  that  was  discussed  in  Section  1.3. 


However,  the  SSQME  test  will  practically  be  “almost”  optimal  robust  (in  the  sense 
that  for  V  g  ^  Pq  performance  is  bounded  in  a  narrow  neighborhood: 

I  ^^0)  7o)~^(^0)?)  I  <  ®0)  ^  )  ”*■  0)  according  to  the  discussion  preceding  Fig.  1.2 

of  Section  1.3),  if  the  right  hand  side  of  the  maximin  relation  (4.7)  could  be  replaced  by 
Eq.(4.19)  such  that  the  constant  C  (which  generally  depends  on  e,  and  /3{Iq,  go))  is 
very  close  to  unity.  From  Eq.(4.22),  C  is  obtained  for  the  p.d.f.  for  which  r  achieves  its 
smallest  possible  value  over  Po-  Some  algebra  yields 


log  C  = 


1 _ log  «o 

1  “  '’min(^)  log  /?(/o,  go  ) 


log  P{lo,  90  ) 


(4.23) 


It  is  clear  from  this  expression  that  c  — *•!  monotonically  as  qjo— »-0  and  as  ►!. 

Some  basic  calculus  shows  that  this  convergence  is  also  true  when  /?(/o.7o)^l-  ^.t  this 
stage  we  restrict  the  class  Po  to  the  nominal  Gaussian  case.  Using  Eqs.  (4.9)  and  (4.11) 

K 

K'^Pj  {  I  o;  I  >  K}  +  f  x‘^q[x)dx 

-K 
K 

If  q(x)dxf 
-K 


V{lo,Q] 

q  =  (1  _  e)jg+  £A 


(4.24) 
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where  Qq  is  the  normal  p.d.f.  and  K[e)  is  given  in  (4.10).  It  is  not  difficult  to  see  that 
(4.24)  and  hence  also  r  are  uniquely  minimized  when  A  is  a  point  mass  at  the  origin,  as 
each  of  the  terms  in  the  numerator  achieves  its  minimal  value  and  the  denominator  its 
maximal  value;  using  Eq.(4.24)  with  (4.10)  yields 


^  min(^) 


inf 

=  (1  -  e)ffo  +  eA 


V{lo,  q) 

V{lo,  ?o) 


hence  r  1  as  e  — >•  0. 


l-eK^V{lo,  q*) 

[1+^  V{lo,  ?o)]' 


(4.25) 


e 

.001 

.01 

.1 

.2 

.3 

.5 

K{e) 

2.63 

1.945 

1.14 

.862 

.685 

.436 

«0 

^  min(^) 

.9910 

.9396 

.6108 

.349 

.177 

.028 

0.2 

.995 

.97 

.78 

.62 

.52 

.44 

10-2 

0*  ==  0.5 

.999 

.994 

.96 

.93 

.91 

.89 

0.9 

1-2-10-^ 

1-1  10-^ 

.999 

.998 

.998 

.998 

0.2 

.997 

.98 

.89 

.81 

.76 

.72 

10-" 

0*  =  0.5 

1-5-10^ 

.997 

.98 

.97 

.96 

.95 

0.9 

1-1 -10-^ 

1-7 -10-® 

1-5-10-^ 

.999 

.999 

.999 

10-® 

0.2 

.998 

.99 

.93 

.88 

.84 

.81 

0*  =  0.5 

1-3  10-^ 

.998 

.986 

.98 

.97 

.96 

0.9 

1-7  10-® 

1-5 -10-® 

1-3-10-^ 

1-5  10-^ 

1-7 -10-“^ 

.999 

0.2 

.9987 

.991 

.95 

.91 

.88 

.86 

10-® 

0*  =  0.5 

1-2- 10-^ 

.998 

.99 

.98 

.98 

.97 

0.9 

1-5  10-® 

1-3  10-® 

1-2  10-^ 

1-4  10-^ 

1-5  10-^ 

1-6-10-^ 

Table  4.4  -  The  constant  C  such  that  /3(/o,  q)  >  C  /3{lo,  q^), 
V  9  €  Pq-  SSQME  test  for  Sw.l  target 
in  nearly  Gaussian  noise. 
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The  computation  of  (4.23)  with  (4.25)  is  summarized  in  Table  4.4.  We  observe  that 
for  the  typical  small  ag’s  and  large  /?o’s  desired  in  radar-sonar  systems,  C  is  almost 
indistinguishable  from  unity,  even  for  very  large  contamination  e.  Thus,  for  any  g  €  Pq 
and  for  the  the  range  of  practical  interest,  the  asymptotic  power  is  almost  equal  to  the 
maximin  bound,  and  the  SSQME  test  is  “almost  robust”  as  conjectured.  As  in  the 
deterministic  amplitude  case,  the  performance  of  the  SSQME  test  is  characterized  by  the 
identical  decrease  in  the  effective  SNR  S{Iq,  qQ)/{nA^/2a‘^),  which  is  very  small  for 

small  e.  Thus  the  qualitative  behavior  as  in  Fig.  4.2  is  repeated  here,  with  dramatic 

(asymptotic)  improvement  over  the  test  of  Chap.  3. 

Further  thought  reveals  that  this  “almost  robustness”  is  a  consequence  of  always 
using  the  same  fixed  threshold.  If  it  were  also  possible  to  precisely  estimate  the  variance 
of  the  test  statistic  under  Hq  and  to  adjust  the  threshold  accordingly: 

)=-21nQ'oF(go  )=“21naoF(g  )=-21naoV'(g  ),  then  we  would  obtain 

*)  Vg  6  Pq.  Since  in  practical  applications  almost  always  the 

scale  (variance)  of  the  nominal  noise  p.d.f.  is  also  unknown,  adaptive  thresholding  will 

be  unavoidable  in  conjunction  with  the  robust  structure  to  maintain  CFAR.  This  will 
be  treated  in  more  detail  in  Section  5.2 

Comparing  the  known  lowpass  signal  [6]  and  the  non-random  amplitude  cases  with 
the  Rayleigh  amplitude  case,  some  thought  suggests  that  the  power  fails  to  satisfy  the 
desired  maximin  relation  exactly,  due  to  an  absence  of  a  mode  in  the  p.d.f.  of  the  test 
statistic  T[Iq)  in  the  present  case.  To  verify  this,  we  also  consider  the  following  case 
which  does  possess  a  mode,  and  represents  some  physical  targets. 

b)  Swerling  Case  3  Target. 

This  model  corresponds  to  one  dominant  reflector  plus  Rayleigh;  the  amplitude 
p.d.f.  is  f  {A)  =  8A^exi^{-2A'^/ A^)/ for  A  >0  [17].  In  this  case,  the  desired 
behavior  of  the  deterministic  amplitude  case  is  possible. 
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Proposition  4.3  Proposition  4.1  is  valid  for  a  Swerling  3  target  model. 

Proof  The  power  of  the  SSQME  test  is  found,  in  a  manner  similar  to  [13],  by  averaging 
the  non-random  amplitude  result  with  respect  to  the  above  amplitude  p.d.f.,  and  is  given 
by 


I3{1,  q)  = 


\  ,  t{l)/2V{l,q) 

'-t{l)/2V{l,  q)  ■ 

(l  +  5(/,g)/2)(l+2/5(/,y)_ 

exp 

1  +  S{1 ,  q  )/2 

(4.26) 


With  the  threshold  set  according  to  the  known  p.d.f.  q^, 


qo)  = 


1  - 


log  0!oS{l,  qo)/2 


+  q^)/2f 

Denoting  S{1 ,  qo)/2  =  y  and  differentiating, 

,  ? 0  )  -log^o  ) 


ao 


l/[l+5(/,  ?*)/2| 


dy 


(1+2/)^ 


[2(1+2/)  -  2/log«o 


which  is  positive  for  all  y  >  0,  hence  sup  /?(/,  y*)  =  I3{Iq,  q*o)  follows  by  virtue  of 


(4.12).  Also,  with  «(/o)  =  -2V{Iq,  gro*)logao. 


/?(/o,  ?)  = 


log  ao  5(/o,  yo  )/2 


a. 


^/[r+S{lo.  ?o*)/2] 


(4.27) 


{r  +S{lo,  q;)/2f 

where  r  A  V{Iq,  q)/V[lQ,  q^)  <1  as  before.  Denoting  S{lQ,qo)/2  =  x  and 
differentiating  with  respect  to  r ,  yields 

d^jlg,  q)  ao^/^'‘"^^hogao  g 


[x^+  logcvo  X  -  r‘ 


(4.27a) 


dr  {r+xf 

Non-negativity  of  the  term  in  square  brackets  for  all  r  <1,  is  sufficient  for 
/^(^0»  q)  A.  0{^O)  qo)-  Clearly,  from  the  solutions  of  the  quadratic  equation  in  x  the 
positive  root  has  to  be  taken,  hence 

(4.28) 


S{lo>  qo)  >  Sc  =  max  {-log  ao  +  A/log“ao  +  4r^  }  =  -logorQ  +  -y/log^oro  +  4 
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From  (4.27a),  it  is  clear  that  (4.28)  is  also  necessary  for  the  right  hand  side  of  the 
saddle-point  equation.  (If  it  was  not,  /9(/o,  q)  would  have  decreased  at  g  =  q^). 

The  critical  values  are  given  in  Table  4.5;  they  are  somewhat  higher  than  those  of 
the  deterministic  amplitude  case,  but  still  in  the  range  of  practical  interest.  Since  the 
one-dominant-plus-Rayleigh  p.d.f.  corresponds  to  Chi-squared  p.d.f.  of  with  2  degrees 
of  freedom,  while  the  non-random  amplitude  can  be  regarded  as  the  limit  of  the  Chi- 
squared  family  when  the  number  of  degrees  of  freedom  N  tends  to  infinity,  it  is  conjec¬ 
tured  that  whenever  the  amplitude  squared  belongs  to  this  family  but  N  I  (Ray¬ 
leigh),  there  exists  a  such  that  when  /3(/o,  ?o)  ^  Pc  ^^e  SSQME  test  is  asymptoti¬ 
cally  maximin  robust,  and  I3(j  decreases  when  N  increases.  It  seems,  however,  that  each 
case  needs  a  separate  derivation  to  prove  this  conjecture. 


«0 

10'^ 

10“2 

10“^ 

10-^ 

10-® 

10“® 

10“® 

5.36 

9.70 

14.1 

18.7 

23.2 

27.8 

36.9 

0.78 

0.75 

0.743 

0.740 

0.738 

0.738 

0.737 

Table  4.5  -  Critical  values  of  /9(/o,  go  )  for  which  the 
SSQME  test  is  maximin  robust,  Swerling  3  target. 

4.4  Relationship  with  Weak  Signal  LORD 

For  a  given  observation  sequence,  the  M-estimates  defined  by  Eq.  (4.2)  are  fixed 
numbers;  they  are  expected  to  be  close  to  zero  under  Hq  and  also  under  when  A  is 
small,  due  to  the  robust  properties  of  the  estimator.  Thus,  an  approximation  is  obtained 
by  expanding  the  summands  of  (4.2)  in  a  Taylor  series  in  powers  of  A  ,  and  then  solving 
(4.2)  using  only  the  two  leading  terms.  This  yields 

E  ^  ) 

^  E  K-.) 

t=i 


(4.29) 
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The  second  approximation  is  justified  as  follows:  For  nominal  Gaussian  p.d.f.  with 
the  optimal  robust  nonlinearity  Iq,  the  denominator  just  equals  the  number  of  a;,-  which 
are  in  [— iir(€),  it  is  quite  close  to  n  when  e  is  small  (since  FC  (e)  is  then  larger  than 

the  variance  of  the  samples),  and  only  weakly  changes  for  different  realizations  of  x. 

All  of  the  locally-optimal  robust  detectors  that  have  been  studied  in  [4],  [5],  [9],  [10] 
and  [l2|  are  based  on  this  test  statistic,  and  thus  they  all  hide  an  ”  approximate  robust 
estimator  of  zero.”  It  is  worthwhile  to  note  that  the  weak  result  of  [4]  and  [5],  namely, 
the  LORD  satisfies  the  maximin  relation  Eq.  (1-10)  on  the  slope  of  the  power  function  at 
A  =  0  only  for  a  >  Q:niin(e)  is  actually  a  consequence  of  this  approximation.  (It  should 
be  noted  that  this  lower  bound  is  much  larger  than  false-alarm  levels  of  any  practical 

E  ^  (A  1 

interest).  Though  the  incremental  SNR  =  lim  - -  f  is  identical  with  (4  16a] 

A  -  0  VAR  (A)  V  • 

the  numerator  and  denominator  are  somewhat  different  from  those  stated  in  Lemma  1 

^i(^)U^o  =  A  F^(r  )  +  0(a2);  FAR(i)U=o=^^,(/")  (4.30) 

Thus  there  are  actually  two  related  optimization  problems  (for  a  and  ^  ),  which  are  not 
parametrized  by  the  same  functional  V(^l,q'j  as  in  the  SSQME  test;  this  leads  to  a 
different  condition  that  must  be  satisfied  for  Eq.(l.lO)  to  hold  -  see  [4],  [5]. 

The  same  approach  could  be  taken  for  our  problem.  The  structure  of  Eqs.  (4.3)- 
(4.6)  is  preserved,  but  the  M-estimators  are  replaced  with  A  of  (4.29).  The  locally- 
optimal  robustness  can  be  verified  using  similar  techniques  as  those  in  [4].  Under  the 
regularity  conditions  stated  in  [4],  which  require  more  than  the  conditions  in  (4.1),  the 
central  limit  theorem  is  satisfied  and  the  detection  probability  is  again  Marcum’s  Q- 
function  where  the  parameters  {a ,  b  }  are  modified  according  to  (4.30).  Using  the 

expression  for  —  Q (a,  6)  in  Eq.  (4.17),  along  with  /i(0)  =  0,  If  (0)  ^  0,  it  can  be 

shown  that  the  right  side  inequality  of  Eq.  (1.10)  is  satisfied,  and  thus  the  detector  is  LO 
robust,  for  a  limited  range  in  the  false  alarm  probability  given  by 
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Q'o  >  Q^minW  =  min  [d{q)e{q)]  2/l«(?)-l|  (4  31) 

?  €  Pq 

where 


and 


d{q)M 


E,{lo'  ) 


d{q)  >1  \iq  ePo 


(4.32) 


eiq)^ 


EAii) 


e  (9)  >  1  \f  q  ePo 


(4.33) 


where  /g  = -(/n  gg)'  is  the  locally-optimal  nonlinearity.  The  inequality  in  (4.32)  is 
valid  since  Ig'  is  zero  over  the  interval  where  the  least-favorable  p.d.f.  q-g  of  Eq.  (4.13) 
puts  all  of  its  contamination;  the  one  in  (4.33)  is  valid  due  to  the  last  mentioned  fact  on 
qo  and  to  the  additional  regularity  requirement  that  /g  is  monotonic  nondecreasing. 
Because  of  this,  the  false  alarm  bound  of  Eq.  (1.3)  is  obtained  also  here  for  V  a  6  (0,  1), 
but  not  by  the  same  derivations  as  for  the  SSQME  test.  Utilizing  these  inequalities,  it  is 
easily  verified  that  amin(e)  G  (0,  1).  However,  we  have  not  solved  (4.31)  to  see  if  the 
actual  values  are  small  enough  to  be  acceptable,  as  we  believe  that  the  SSQME  test  is  a 
preferred  solution;  it  is  neither  restricted  by  any  Qfmin(€)  nor  limited  to  weak  signals.  This 
conclusion  has  been  verified  by  simulation  results,  to  be  presented  in  Section  5.4 
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5.  ROBUST  DETECTORS  FOR  A  REALISTIC  ENVIRONMENT 

The  derivation  of  maximin  robustness  of  the  SSQME  test  in  Chap.  4  was  based  on 
several  major  assumptions,  all  of  which  have  been  required  for  previous  studies  of  robust 
detection  [4]-[l2].  The  following  were  assumed  as  completely  known:  a)  the  percentage 
of  contamination  e,  b)  the  nominal  p.d.f.  in  the  class  Pg;  specifically,  the  variance  was 
assumed  known  for  the  nearly  Gaussian  case,  c)  the  frequency  of  the  signal  to  be 
detected.  Such  knowledge  concerning  the  detection  environment  is  usually  absent  in 
any  realistic  application  where  robust  procedures  are  needed.  This  is  particularly  true 
under  the  detection  environment  of  radar  (sonar)  systems  where  the  clutter  (reverbera¬ 
tion)  processes  rapidly  change  in  time  and  space.  This  section  extends  the  utility  of  the 
SSQME  test  for  such  situations,  and  ends  with  a  finite-sample  Monte-Carlo  analysis. 

5.1  Sensitivity  to  Unknown  e 

In  many  previous  investigationss  [4]-[l2]  the  amount  of  contamination  e  was 
assumed  known.  Clearly,  this  might  not  hold  for  a  nonstationary  and  nonhomogeneous 
detection  environment.  Theoretically,  when  any  knowledge  of  e  is  lacking,  the  mixture 
family  is  no  longer  convex  and  is  not  defined  well  enough  to  allow  an  optimum  robust 
test.  In  practice  though,  it  might  be  possible  to  bound  based  on  physical  con¬ 

siderations.  (If  emax=l)  the  robust  estimators  of  the  SSQME  test  reduce  to  median  esti¬ 
mators). 

An  obvious  but  apparently  unstated  observation  is  that  any  density  of  the  form 
/  {x  ■,e)  of  Eq.  (2.3a)  belongs  to  the  mixture  family  Po(eniax)  if  f  ^ ^^max- 

(Ue)/  Q+eh  =  (l-Ctnax)/  0+^max[(I - ~)  f  fld - - — ^  ]  =  ■  •  •  (5.1) 

^max  ^max 

*  ‘  '  =  (l“^max)/  0+^max^  ^  ;  (/  ^  €  M 

and  the  expression  in  square  brackets  /i '  is  a  legitimate  density  because  it  is  positive 
(l-€/€jnax^0)  integrates  to  unity. 
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Table  5.1  shows  the  asymptotic  variances  of  the  M-estimators  of  Eq.(4.2)  when  the 
input  noise  density  is  a  Gauss-Gauss  mixture  and  is  the  variance  of  the  contaminat¬ 
ing  density.  They  were  computed  from  Eq,(4.11)  with  the  soft-  limiter  of  (4.9-4.10).  In 
each  sub  table  the  limiter  break-point  k  is  designed  according  to  e^iax?  the  variances 
are  given  for  Also  indicated  in  the  headings  are  the  upper  bounds  on  the  estima¬ 

tion  variance  computed  with  the  least-favorable  density.  It  is  seen  (and  can  be  shown 
analytically  using  the  properties  of  the  Gaussian  c.d.f.  when  solving  (4.11)  in  conjunction 
with  (4.14))  that  the  variance  monotonically  increases  with  e  and  c  to  the  minimax 
bound.  A  similar  behavior  occurs  with  the  detection  probability  since  it  is  a  monotone 
function  of  the  effective  SNR  Eq.(4.16a),  which  is  inversely  proportional  to  the  estima¬ 
tion  variance. 

The  main  conclusion  from  the  table  is  that,  even  when  the  test  is  designed  for  large 
^maxj  variance  is  much  smaller  when  actually  An  even  more  important  con¬ 

clusion  is  drawn  from  Table  5.2,  which  shows  a  very  pessimistic  case:  the  test  is  designed 
with  k  (ejj^3^  =  0.5),  and  the  entries  in  the  table  are  the  ratio  of  variances  between  those 
obtained  with  this  design,  and  those  that  could  be  obtained  if  e  was  actually  known  and 
A;(£)  was  used-  Var  {k[e^^,e)/Var  {k{e),e).  It  is  clearly  seen  that  even  for  €=0.001  the 
efficiency  loss  is  merely  about  28%.  This  is  translated  to  an  increase  of  only  1.07  dB  in 
the  SNR  required  to  achieve  the  same  .  When  can  be  more  tightly  bounded,  the 
difference  is  even  smaller. 

Similar  behavior  is  found  when  the  contaminating  density  is  the  longer  tailed 
Laplace  density,  as  can  be  seen  by  dividing  the  entries  of  Table  5.3a  by  the  correspond¬ 
ing  ones  in  5.3b.  (Here  however,  the  variance  is  not  monotone  with  e  for  small  c  ). 

To  summarize  this  discussion,  we  have:  A  design  with  preserves  the  maximin 
robustness  properties  for  \/  the  loss  incurred  from  not  knowing  the  actual  e 

is  shown  numerically  to  be  reasonably  small  for  the  Gauss- Gauss  and  Gauss-Laplace 
mixture  families  considered. 
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Table  5.1  Estimation  variances  with  fixed  limiter 
Gauss— Gauss  mixture. 


EPSMAX=0. 

005  K(EPSMAX)=  2.160 

VMAX(K) 

=1.037 

F=(l 

-EPS)*N(0, 

, 1)+EPS*N(0, 

C**2) 

EPS/C 

1 

3 

10 

30 

100 

0.0010 

1.0066 

1.0097 

1.0116 

1.0123 

1.0125 

0.0020 

1.0066 

1.0127 

1.0167 

1,0180 

1.0184 

0.0030 

1.0066 

1.0158 

1.0218 

1.0237 

1.0243 

0.0040 

1.0066 

1.0189 

1.0269 

1.0294 

1.0303 

0.0050 

1.0066 

1.0220 

1,0320 

1.0351 

1.0362 

EPSMAX=0.010  K(EPSMAX)=  1.945  VMAX(K)=1 . 065 
F=(1-EPS)*N(0, 1)+EPS*N(0,C**2) 


EPS/C  1  3  10  30  100 

******ir****tlr****************i^***i<r*********^*  **********  ★*★■***** 


0.0020 

1.0121 

1.0177 

1.0210 

1.0220 

1.0224 

0.0040 

1.0121 

1.0233 

1.0300 

1.0321 

1.0328 

0.0060 

1.0121 

1,0289 

1.0391 

1.0422 

1.0432 

0.0080 

1.0121 

1.0346 

1.0482 

1.0523 

1.0538 

0.0100 

1.0121 

1 . 0403 

1.0573 

1.0626 

1.0644 

EPSMAX=0. 

050  K(EPSMAX)=  1.399 

VMAX(K) 

=1.256 

F=(l 

-EPS)*N(0. 

1)+EPS*N(0, 

C**2) 

EPS/C 

1 

3 

10 

30 

100 

0.0100 

1.0467 

1.0695 

1.0807 

1 . 0840 

1.0852 

0.0200 

1.0467 

1.0928 

1.1157 

1.1226 

1.1250 

0.0300 

1 . 0467 

1.1165 

1.1518 

1.1625 

1.1662 

0.0400 

1.0467 

1.1408 

1.1890 

1.2037 

1.2088 

0.0500 

1 . 0467 

1.1655 

1.2274 

1.2464 

1.2530 

★★★★★★★★*★*■***★****★★★ 

max* 
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Table  5.1  (continued). 


Table  5. 1  (continued). 

F=(1-EPS)*N(0, 1)+EPS*N(0,C**2) 


EPS/C 

1 

3 

10 

30 

100 

0,0200 

1.0812 

1.1239 

1.1433 

1.1490 

1.1509 

0.0400 

1.0812 

1.1685 

1.2092 

1.2214 

1.2254 

0.0600 

1.0812 

1.2149 

1.2793 

1.2989 

1.3053 

0.0800 

1.0812 

1.2634 

1.3539 

1.3817 

1.3909 

0.1000 

1.0812 

1.3139 

1.4333 

1.4706 

1.4829 

EPSMAX=0. 

200  K<EPSMAX)=  0,862 

VMAX(KJ 

=2.046 

Fs(l 

-EPS)*N(0, 

1)+£PS*N(0, 

C**2) 

EPS/C 

1 

3 

10 

30 

100 

0 . 0400 

1.1393 

1.2221 

1.2574 

1.2678 

1.2706 

0.0800 

1.1393 

1.3121 

1.3904 

1.4142 

1.4208 

0.1200 

1.1393 

1.4i02 

1.5411 

1.5818 

1.5933 

0.1600 

1.1393 

1.5175 

1.7125 

1.7740 

1.7928 

0.2000 

1.1393 

1.6350 

1.9083 

1.9982 

2.0249 

EPSMAX=0.30Q  K(EPSMAX,«  0.685  VMAX( K)=2 . 822 
F={1-EPS)*N(0, 1)+EPS*N(0,C**2) 


EPS/C 

1 

3 

10 

30 

100 

******************** 

***************** 

********** 

*********' 

0.0600 

1.1916 

1.3162 

1.3684 

1.3038 

1.3870 

0.1200 

1.1916 

1.4580 

1.5802 

1.6177 

1,6260 

0.1800 

1.1916 

1.6200 

1.0367 

1.9058 

1.9220 

0.2400 

1.1916 

1.8061 

2.1508 

2.2655 

2.2941 

0.3000 

1.1916 

2.0214 

2.5405 

2.7219 

2.7690 

EPSMAX=0.400  K(EPSMAX)=  0.555  VMAX(K)=3 . 996 
F=(1-EPS)*N(0, 1)>EPS*N(0,C**2) 


EPS/C  1  3  10  30  100 

***«^********  *****************************************  ********* 


0.0800 

1.2395 

1.4087  1.4794 

1.5007 

1.5061 

0.1600 

1.2395 

1.6100  1.7855 

1 . 8409 

1.8557 

0.2400 

1.2395 

1.8518  2.1838 

2.2945 

2.3256 

0.3200 

1.2395 

2.1455  2.7140 

2.9160 

2.9761 

0 . 4000 

1.2395 

2.5064  3.4395 

3.7973 

3.9099 

***'***'************************************^*^«iii^^« 

************ 

EPSMAX=0. 

500  K(EPSMAX)=  0.436 

VMAX(K)= 

•5.928 

F=(l 

-EPS)*N(0, 1)+EPS*N(0, 

C**2) 

EPS/C 

1 

3  .  10 

30 

100 

0.1000  1.2916  1.5086 

0.2000  1.2916  1.7789 

0.3000  1.2916  2.1211 

0.4000  1.2916  2.5626 

0.5000  1.2916  3.1447 

***********************^*^*, 


1.5999  1.6267  1.6294 
2 . 0195  2 . 0949  2 . 1047 
2.6094  2.7747  2.8021 
3.4734  3.8124  3.8808 
4.8072  5.5038  5.6726 
************************^*, 
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Table  5.2  ratio  of  estimation  variances  between  designs  with  fixed  k[ 
and  variable  A;  (e), Gauss— Gauss  mixture. 


VAR(K(EPSMAX) )/VAR(K(EPS) )  EPSMAX=0.500  K(E  PSMAX)= 


F=(1-EPS)*N(0, 1)+EPS*N(0,C**2) 


EPS/C  1  3  10  30  100 

*★******•****★*****★**★*******•**********-********************** 


0.0010 

1.2897 

1.2868 

0.0050 

1.2832 

1.2734 

0.0100 

1.2762 

1.2605 

0.0500 

1.2340 

1.1963 

0.1000 

1.1946 

1.1482 

0.2000 

1.1337 

1.0880 

0.3000 

1.0839 

1.0493 

0.4000 

1.0420 

1.0224 

0.5000 

1.0000 

1.0000 

1.2838 

1.2828 

1.2824 

1.2645 

1.2617 

1.2604 

1.2471 

1.2428 

1.2409 

l.i687 

1.1600 

1.1547 

1.1162 

1.1062 

1.0987 

1.0582 

1.0484 

1.0394 

1.0271 

1.0194 

1.0117 

1.0099 

1 . 0040 

1.0031 

1.0000 

1.0000 

1.0000 

Table  5.3  Estimation  variances  for 


Gauss-Laplace  mixture,  a)  Fixed 


b)Variable  ^(e) 


EPSMAX=0.500  K(  EPSMAX)=  0.436  VMAX(  K  )=:5 . 928 


F=(1 

-EPS)*N(0, 

1 )+EPS*LAPLACE(0,C**2) 

EPS/C 

1 

3 

10 

30 

100 

************************************************************* 

0. 1000 

1 . 1863 

1.4343 

1.5708 

1.6172 

1.6379 

0.2000 

1.0922 

1.5996 

1.9403 

2.0676 

2. 1252 

0.3000 

1.0076 

1.7924 

2.4425 

2.7139 

2.8392 

0.4000 

0.9314 

2.0189 

3.1478 

3.6848 

3.9411 

0.5000 

0.8626 

2.2874 

4. 1795 

5.2347 

5.7638 

************************************************************* 

VARI 

ANCES  WITH 

OPTIMAL 

K( EPS) 

F=(1 

-EPS)*N(0, 

1 )+EPS*LAPLACE(0,C**2) 

EPS/C 

1 

3 

10 

30 

100 

^#4f4f.)f.)f.)f.}f^^^4f^^^4f#4f******************************************* 

0.1000 

1.0438 

1.2455 

1.3979 

1.4570 

1 .4799 

0.2000 

1.0318 

1.4711 

1.8222 

1.9647 

2.0244 

0.3000 

0.9914 

1.7138 

2.3675 

2.6536 

2.7837 

0.4000 

0.9327 

1 .9821 

3.1101 

3.6591 

3.9318 

0.5000 

0.8626 

2.2874 

4.1795 

5.2347 

5.7638 

.H-***4f**#4f*4f  ****#**■«**•«•#■«•*■«>■«•  ■H>4f  *********** -tf-tf  ***  *#****•«■■«■ 


max — 0-S) 


0.436 
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5.2  Robust  Scale  Invariant  Tests 


The  scale  a  of  the  nominal  density  in  the  e-mixture  family  affects  the  implementa¬ 
tion  and  performance  of  the  SSQME  test  in  two  ways.  First,  a  must  be  known  for  con¬ 
struction  of  the  optimal  robust  M-estimates,  as  the  break  points  of  the  nonlinearity  are 
functions  of  it,  Eq.(4.14);  e.g.,  they  are  equal  to  ±<7^  (e)  for  the  nominal  Gaussian  case. 
Second,  the  detection  threshold  setting  ^(fo)  must  be  proportional  to  to  obtain 

the  desired  Oq  level,  (see  Table  4.1),  and  V  itself  is  proportional  to  cr^.  Even  if  the 
optimal  M-estimators  are  replaced  by  some  other  robust  and  scale  invariant  estimators 
of  the  quadrature  locations,  the  second  crucial  problem  must  still  be  addressed. 

In  the  specific  case  of  nominal  Gaussian  density,  which  is  the  main  interest  of  our 
work,  the  first  problem  is  solved  together  with  a  substantial  simplification  in  implemen¬ 
tation  complexity  by  utilizing  Tukey’s  alpha-trimmed  mean  estimator  instead  of  the  M- 
estimator.  This  estimate  is  defined  by: 


1 


n  (l-2a) 


n  -l-\n  a] 


E  +  (l+[na]-na)[2:(i+|„„|  -P 


i  — 2+(n  a 


(5.2) 


where  are  the  ordered  samples:  a;(i)<a;(2)< . and  [na]  is  the  greatest  integer 

in  n  O'  for  0<a<0.5.  The  estimator  simply  deletes  [n  a\  samples  from  each  end  and  then 
takes  the  weighted  mean  of  the  remaining.  Its  advantage  over  Huber’s  M-estimator  is 
that  solving  an  implicit  nonlinear  equation  is  not  necessary.  With  the  above  weighting  it 
is  translation  invariant  for  all  n  (and  unbiased  whenever  Ef  {x^)  exists).  Bickel  [33] 
showed  that  it  is  asymptotically  Gaussian  and  its  variance  for  symmetric  /  is 


Var  {'/^X^)Acr^. 


2 


^  l-Qf 

j  x^f  {x)dx+axi^ct 

0 


(5.3) 


where  F{xi_a)=l-a.  The  two  estimators  are  quite  similar  for  proper  choice  of  a  and 
k  (e);  they  both  sum  linearly  the  main  bulk  of  the  samples,  but  Huber’s  estimator  treat 
large  samples  and  outliers  by  limiting  them,  while  the  a-trimmed  estimator  censors  them 
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completely.  The  main  difference  is  that  for  small  n  the  a  -trimmed  will  always  discard 
some  of  the  observations,  even  if  they  all  are  small  and  come  from  the  nominal,  while 
the  other  might  not.  For  n-*Qo,  however,  they  are  equivalent  in  quadratic  mean.  The 
variance  of  Huber’s  estimator  with  the  soft-  limiter  nonlinearity  /o(a;  ;-k,k)is 


)  = 


k 


2[Jx^f  {x)dx  +  k^F{-k)\ 


[l-2F{-kf 


(5.4) 


This  is  identical  to  (5.3)  when  one  choses  Q;(e)=F  (-it  (e)).  Hence  they  both  have  the 
same  asymptotic  distribution  with  fixed  F .  Moreover,  the  trimmed-mean  estimator  is 


also  minimax  for  e-mixture  family  with  Gaussian  nominal  [3],  hence  it  can  be  substituted 
in  the  SSQME  test  without  change  in  its  properties. 


Two  different  asymptotically  scale  invariant  extensions  of  the  SSQME  test  are  next 
discussed.  The  difference  between  them  depends  on  the  availability  of  a  noise- reference” 
channel. 


a)  “Sliding- Window”  robust  quadrature  test 

We  follow  the  definitions  and  the  physical  setup  of  section  2.2.  In  addition  to  the 
“test-cell”  samples  {x,y},  there  are  also  available  M  “noise-  reference”  vectors  of  obser¬ 
vations:  {«,-y  },2  =  ,j  =  1,...,M.  uy  =1,...M  are  statistically 

independent  vectors  that  are  identically  distributed  as  x  is  under  Hq,  and  likewise  for 
vy  and  y.  An  adaptive  threshold  test  is  constructed  by  using  the  SSQME  structure  in 
both  the  “test-cell”  statistic  R(x,y)  and  in  the  threshold  estimator  l'F(u,v)  of  Eqs.(2.8)- 
(2.11).  Specifically,  the  test  is 

DECIDE  Hi  iff  Ra{^,y)  >  t{M)W^{n,v)  (5.5) 

where  R  a{'x.,y)  is  identical  with  T„{lo)  of  Eq.(4.3)  with  the  M-estimators  and  Ay 

replaced  by  the  a~  trimmed  means  of  the  x  and  y  samples,  respectively.  In  a  similar 

M  _  _  _ 

manner,  W,,(u,v)=(l/M)  ^  where  a==  +  V/a  i  and  Fy„ 

;=1 


are  the  a- 


trimmed  means  of  Uy  and  vy ,  respectively.  The  test  structure  is  shown  in  Fig.  5.1. 


Fig.  5.1  Sliding-Window  scale  invariant  SSQME  robust  test 


TEST- CELL 
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We  can  summarize  the  properties  of  this  test  in  the  following  proposition: 

Proposition  5.1  Consider  the  detection  problem  with  the  observables  of  Eq.(2.7),  with 
the  uncertainty  in  the  distribution  of  the  quadrature  noise  samples  obeying  an  e-mixture 
model  Pq^,  where  the  scale  a  of  the  nominal  Gaussian  distribution  is  unknown.  Let 
be  the  class  of  all  hypothesis  tests  of  the  form  R  [T Ty)> 

=1,  ■  ■  ■  M)  where  W  and  R  are  as  in  Eq.  (5.5),  with  the  or-trimmed 

estimators  replaced  by  any  other  asymptotically  normal,  translation  invariant  and  scale 
equivariant  statistic  T  (i.e.,  T  (crx-|-a  )=<Tr  (x)-|-a ).  Let  d[a)  €D,vf  be  the  test  with  a- 
trimmed  estimators,  and  let  be  the  class  of  all  decision  rules  based  on  {T^,Ty}  -  i.e., 
without  imposing  the  structure  of  Eq.  (5.5)  and  precluding  reference  samples.  Then: 

a)  The  test  is  DF-CFAR  for  any  q  GPqo-  and 

Pf,={l+t{M)/M)-^  (5.6) 

b)  If  0'*(e)=F'o  (-^  (e)),  where  Fq  is  the  c.d.f.  of  Qq,  then  and  d*{Q:*{e))  are  a 
saddle-point  pair  in  (Pq^)  ^m)  ^be  asymptotic  detection  probability 


sup/5(9o,«^)  =  l3{qo,d*)  =  M^{q,d*) 

i.e.,  the  test  is  asymptotically  maximin  robust.  This  is  true  for  any  distribution  of  the 
amplitude  A  ,  or  for  any  amplitude  level  if  it  is  deterministic.  For  the  Rayleigh  signal 
case,  /3{qo,d*)  is  given  by 


^(90,  d*)^ 


1+ 


a 


n-M 


l+‘S'(9o  ) 


nA  ■ 


5’(9o)  =  ^/(9o*(^=1)) 


2(7“ 


(5.7) 


c)  When  also  M— >oo,  d*  is  equivalent  to  the  Neyman-Pearson  test  in  the  wider  class 
Dqo  for  assuming  that  a  is  known  and  only  {T^^Ty}  are  available  as 

observables. 

Remarks  1)  Unlike  the  previously  studied  SSQME  tests,  here  the  threshold  multiplier 
t{M)  is  not  a  function  of  the  variance  of  the  location  estimators  T,  hence  the  DF- 
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CFAR  property.  2)  Unlike  the  cases  of  Chap.  4,  the  lower  bound  on  the  detection  proba¬ 
bility  is  valid  for  all  A  >0;  in  particular,  this  test  is  also  maximin  for  the  important  case 
of  a  Rayleigh  signal,  in  contrast  to  the  non-adaptive  test  of  Section  4.3,  case  a).  As  will 
be  clear  from  the  proof  and  from  the  discussion  of  Section  1.3,  this  is  a  result  of  the 
CFAR  property.  3)  The  detectability  loss  incurred  by  the  adaptive  threshold  scheme  is 
typically  small  for  Af  >20,  cf.  [18-20]. 

Proof  a)  First  note  that  by  the  equivariance  assumption,  all  tests  in  Dm  are  scale 
invariant,  even  for  finite  n  .  By  asymptotic  normality,  the  unknown  estimation  variance 
V{qjT)  appears  as  a  common  positive  scale  factor  on  both  sides  of  Eq.(5,5),  and  can  be 
divided  out.  Define 


r„2(x)+r„“(y) 

R  =  ,  W-- 


1 

1  v-i  2 


S  [Tn  {^j)+Tn  (Vy)] 


2V{q,T)/n  ’  2V{q,T)/n) 

Clearly,  W  is  chi-squared  distributed  f  U  {w),  and 

under  Hq,  f  ^  {r)=e~'^  U{r).  Since  W  and  R  are  independent,  =Po{i?  > flF }  is 
simply  computed  and  given  by  (5.6). 

b)  Define  also  0=tan''^(  (y)/r,j  (x)).  With  this  transformation  the  density  of  R  is  com¬ 

puted  straightforwardly  as 


27r 


nA  “ 


f  R{r  \  A  ,  (f)= ^  f  n  Q{r  ,6  \  A  ,4i)d  d  = 

0  2F(g,r) 


\Io[A  (• 


2  nr 


Vlt.T) 


)‘'"l  =  /»(>■  I  A) 


and  in  a  manner  similar  to  the  derivation  of  Section  4.2, 


l3M{q,  T  \A,w)==^P,{R>tW  \A,  W=w}^Q  [A  )^/^  {2tw  )^/^] 

By  monotonicity  of  the  Q  function  with  respect  to  the  first  argument  when  the  second  is 
fixed,  and  since  A  is  positive,  a  minimax  property  on  V{q  ,T)  is  translated  to  a  maxi¬ 
min  property  on  the  conditional  detection  probability,  which  in  turn  imply  maximin  on 
the  unconditional  detection  probability  for  all  F  {A  ),  since 
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00  00 

T)==jdF{A)j^MU,  T  \  A,w)dF{w) 

0  0 

The  derivation  of  (5.7)  is  straightforward  with  substitution  of  F{A)  and  F{W];  the 
result  is  equivalent  to  the  pure  Gaussian  case  (compare  [18-20]),  if  the  effective  SNR  is 
changed  as  indicated  in  (5.7). 

c)  According  to  a  well  known  convergence  theorem,  if  {Ti,  Yo, . }  is  a  sequence  of 

r.v.’s  that  converge  in  probability  to  y  >0,  then  the  distribution  of  the  r.v.  Z„=X /Y^ 
converges  to  Fx{yz).  Clearly,  the  normalized  adaptive  threshold  estimator  W  converges 
in  probability  to  1  as  M—*co,  since  it  is  just  a  sample  mean.  Hence,  substituting  t{M) 
from  (5.6), 

lim  /3m(9,  T  I  A)  =  lim  P  f  M  }  =  jim  g  [A{—-P^—fl‘^,  (2i  (M)-!)^/^]  _ 

M-^oo  M-*oo  W  A/ -^00  V\qjl  ) 

“  M  “00*5  1-^  =  «  1>1  (-2108^/. 

where  the  last  limit  follows  from  the  dominated  convergence  theorem  and  from 
L’HospitaPs  rule.  Since  this  expression  is  identical  with  the  detection  probability  of  the 
optimum  asymptotic  detector  which  operates  on  the  transformed  input  observations 
(y)},  see  Section  4.2,  the  left  side  of  the  saddle-point  relation  is  extended  to 
the  wider  class  T>^,  In  particular,  notice  that  the  limit  {M  — >oo)  of  Eq.(5,7)  is  indeed  the 
(exponential)  detection  probability  of  the  Rayleigh  case. 

Remark  In  the  proof,  we  did  not  utilize  any  property  of  the  e-mixture  family. 
Therefore,  the  proposition  is  valid  for  any  test  based  on  asymptotically  normal,  shift 
invariant  and  scale  equivariant  estimators,  whose  variance  is  minimax  in  some  family  of 
distributions  P.  Some  examples  are: 

a)  The  set  of  all  distributions  differing  at  most  by  e  in  Kolmogorov’s  distance  from  the 
Gaussian  cumulative  of  known  scale,  and  the  corresponding  M-estimator  -  see  [3]. 
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b)  The  set  of  all  P-point  distributions  P={/  : / /  {x)dx=p  },  and  the  corresponding 

-a 

M-estimator  -  see  [6]. 

c)  The  set  of  all  distributions  with  equal  variance  -  this  problem  is  interesting  within 
the  context  of  optimal  minimax  jamming  .  It  turns  out  that  the  saddle-point  solu¬ 
tion  is  nothing  but  the  Gaussian  distribution  and  the  quadrature  test  based  on  the 
sample-means.  Since  the  saddle-point  (within  the  sub-optimal  set  of  SSQME  tests) 
happens  to  coincide  with  the  global  UMP  test  at  that  point,  it  is  also  the  minimax 
solution  in  the  set  of  all  tests  -  see  [45].  (We  also  show  there  that  the  same  is  true 
even  when  the  noise  quadrature  samples  are  not  assumed  a-priori  identically  distri¬ 
buted). 

d)  The  set  of  all  e-mixture  distributions  with  arbitrary  symmetric  nominal  /  o  of  unk¬ 
nown  scale.  Here,  the  a-  trimmed  mean  is  not  the  optimal  estimator,  although  it  is 
still  robust  in  Hampel’s  sense  as  will  be  discussed  in  the  following.  The  appropriate 
estimator  is  either  from  the  class  of  L-  or  R-estimators^.  As  before,  we  take  the 
optimal  estimator  for  the  same  least  favorable  p.d.f.  of  Eq.  (4.13).  The  location 
invariant  and  scale  equivariant  L-estimators  are  given  by  L„{x)= 

where  the  weights  are  generated  from  a  score  function  m(s)  according  to 

i/n 

o-ni  =  f  m(s]ds  (5.8) 

and  m(-}  is  related  to  of  Eq.(4.13)  through 


where 


m  (F 0  (x  ))= 


_1 _ ^ 

/(go)  dx^ 


(-log/  0(2^  ;i)) 


0 


I  ar  I  <  xi 
otherwise 


(5.9) 


^  The  SW  structure  with  R-estimators  is  an  overkill.  It  is  possible  to  obtain  DF-CFAR 
performance  and  maximin  detection  probability  with  a  quadrature  test  based  on 
robustified  rank  statistics.  This  subject  is  studied  in  detail  in  Chapter  6. 
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f  o{^i,l)/k-F4.-x,-\)=e/2[l-e)  ,  |  /  „'  {x i,l) / f  ^{x ^-l)  \  =k 

Notice  that  also  here  the  censoring  percentage  is  i^o(-2;i).  The  minimax  relation 
on  the  estimator’s  variance  remain  valid  (where  F(go  )=l//(9o  ))>  due  to  the 
asymptotic  equivalence  between  L  and  M-estimators  [Jaeckel,  46],  provided  the  a„,- 
are  chosen  as  above.  (Stigler  [47]  proved  later  that  if  the  nominal  p.d.f  is  any  sym¬ 
metric,  twice  differentiable  and  strongly  unimodal  p.d.f.  on  R'^,  the  L-estimator 
corresponding  the  above  score  function  is  indeed  asymptotically  Gaussian  with  the 
variance  assumed  in  [46]). 

To  summarize,  the  SW  adaptive  threshold  extension  of  the  SSQME  test  is  a  suit¬ 
able  choice  for  radar-sonar  systems  where  “noise-reference”  samples  are  conveniently 
available.  It  possesses  all  the  desired  asymptotic  maximin  properties;  its  implementation 
in  the  nominal  Gaussian  case  is  relatively  easy  based  on  a-trimmed  estimators  which  do 
not  require  more  than  ordering  of  the  data,  an  operation  that  is  common  to  most  non- 
parametric  tests  and  is  becoming  available  for  real  time  implementation  with  VLSI  tech¬ 
nology  [16]. 

b) SSQME  test  with  a  preliminary  scale  estimate 

We  assume  now  that  {x,y}  are  the  only  available  observables,  and  the  nuisance 
scale  estimate  must  be  derived  from  the  “signal  plus  noise  ”  data  itself  We  draw  and 
build  on  Hubers  theory  [3]  of  simultaneous  estimation  of  location  and  scale.  For  com¬ 
pleteness  of  the  presentation,  we  next  quote  and  make  some  interpretations  on  Huber’s 
main  results: 

i)  Simultaneous  M-estimates  for  location  and  scale  are  a  pair  of  statistics  (T„,S'„) 
determined  by  solving  equations  of  the  form 

In  radar/sonar  applications  in  a  nonhomogeneous  environment,  utilization  of  spatial 
adjacent  observations  (as  in  the  SW  structure)  is  not  justified  if  the  reference  observations 
are  not  identically  distributed  as  the  main  observations. 
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JL  Xi-T„  ^ 

- )  =  0  (5.10a) 

t=l 

E  )  =  0  (5.10b) 


These  are  similar  to  the  ML  estimates  of  0  and  a  for  a  family  of  densities 
1  X  —9 

— /  ( - ))  with  proper  selection  of  'I'  and  x-  As  most  common  test  statistics  and 

cr  a 

estimators  depend  on  the  samples  only  through  the  empirical  distribution  function 

-^n  (x  )■  {x,<x}!  where  is  the  indicator  function  of  the  set  A,  it  is 

convenient  to  express  r„  (x)  =  T{F„)  and  (x)  =  S{F„)  in  terms  of  the  func¬ 
tionals  defined  by: 


ln~^)F{dj)  =  o'  (5,iu) 

/x(  =“  0  (5.11b) 

Hampel’s  [34]  influence  curve  (IC)  is  a  very  useful  heuristic  tool  of  robust  statistics, 
■which  describes  the  (suitably  normed)  limiting  influence  of  a  single  observation  on 
the  estimator.  Intuitively,  a  qualitative  robust  estimator  must  have  a  bounded  IC. 
If  \fn  [T(F„)  -  T(F)]  is  asymptotically  zero  mean  Gaussian,  its  variance  is  given 
by  (using  (5.13)  below )- 


V{F,T) 


JlC\x-,F,T)F{dx)  =  S\F) 


(5.12) 


with  a  similar  expression  for  V(F  ,S).  If  (and  only  if)  F  and  x  s-re  symmetric,  and 
^  is  skew-symmetric,  the  IC’s  of  the  simultaneous  estimates  are  uncoupled;  in  par¬ 
ticular,  only  S(F)  but  neither  the  IC  nor  the  asymptotic  variance  of  S  enters  into 
the  expression  of  IC(T).  The  IC’s  are  given  by: 


S(F) 


■}S(F) 


fn 


S(F) 


■)F  {dx ) 


IC{x-F,T) 


(5.13) 


IC[x-F,S) 


(5.14) 


X(' 


sm 


■)S{F) 


fx'(' 


-)- 


S(F)'S(F) 


F{dx) 


Note  that  for  a  given  distribution,  T(F)  and  S(F)  are  numbers;  and  are 
Fisher  consistent  estimates  if  T(F)  and  S(F)  are  equal  to  the  estimated  parameters. 
Under  the  above  mentioned  symmetry  conditions,  the  estimates  are  asymptotically 
uncorrelated  and  hence  independent. 

The  asymptotic  robustness  (in  Hampel’s  sense  )  is  clear  from  (5.13)-(5.14),  provided 
that  ^  and  x  are  bounded  functions.  In  Section  6.4  of  [3]  it  is  shown  that  under 
relatively  mild  conditions  the  coupled  estimates  are  consistent, 
(T'„  ,S„  )  — >  [T[F),S[F))  in  probability,  and  are  jointly  Gaussian. 

)  The  IC  of  T  for  the  case  when  the  nominal  scale  is  known  is  given  by  the  same 
expression  as  (5.13)  but  with  S(F)=1.  If  S(F)  does  not  change  too  much  when 
F  ElPq  (a  property  which  must  be  satisfied  for  any  reasonably  consistent  and  robust 
estimate  of  the  scale),  the  desired  properties  of  the  location  estimate  with  known 
scale  will  be  roughly  preserved.  However,  it  is  not  possible  to  obtain  an  exact 
minimax,  since  the  loss  of  the  translation  invariance  symmetry  in  the  multi¬ 
parameter  case  does  not  enable  extension  of  the  parametrization  of  the  nominal 
model  throughout  a  convex  uncertainty  neighborhood,  cf.  [3,  Section  11.1].  It 
should  also  be  clear  that  the  variational  techniques  of  [3]  are  not  applicable  for 
finding  the  density  that  minimizes  V[F  ,T),  even  when  the  asymptotic  variance  of 
S  is  considered  as  a  nuisance  factor,  due  to  the  deep  and  implicit  nonlinear  coupling 
of  Eq.(5.12)  subject  to  Eqs.(5.11). 


)  The  simultaneous  solution  of  Equations  (5.10)  is  perhaps  an  unnecessary  compli¬ 
cated.  Simplified  variants  are  the  one  step  M -estimates.  They  are  obtained  by 
starting  with  some  preliminary  estimates  and  and  then  solving  Eq.(5.11a) 
approximately  by  applying  Newton’s  method  just  once  .  The  following  variant 


-  69  - 


emerged  in  the  extensive  “Princeton  Monte-Carlo  study  ”  of  Andrews  et  al.  [35]  as 
very  robust,  even  for  small  sample  sizes.  The  preliminary  estimates  are  the  median: 
—  med  {xj  },  and  the  median  of  the  absolute  deviations  from  it  (MAD): 
~  tried  {  I  OTj  — I  }/0.6745,  where  0.6745=^>“^(3/4)  to  get  consistency 
at  the  Gaussian  {x{^)=sign{  \  x  \  -1),  thus  5(^\F)=F~^(3/4)  for  a  symmetric  den¬ 
sity  from  (5.11b)).  The  MAD  is  the  limiting  case  (e—^1)  of  the  minimax  robust  scale 
estimate,  and  its  computation  is  simpler.  Then, 


r„(°)(x)  + 


1  t.-TW 

^  5(0) 

_ _ 

1  2;._2’(0) 

^  5(0)  ^ 


(5.15) 


is  asymptotically  (n-^oo)  equivalent  to  the  full  solution  of  Eq.(5.10a) 
provided  the  previously  mentioned  symmetry  conditions  are  satisfied.  Moreover, 
with  these  symmetry  conditions,  any  one-step  estimator  with  translation  invariant 
and  odd  :  TJ-'^Xx+c  )=tJ-^\x)+c  ,  will  have  the  same  IC 

(and  hence  identical  asymptotic  properties)  as  the  full  solution  of  the  coupled  equa¬ 
tions.  Note  that  for  the  nominal  Gaussian  case  the  computation  of  (5.15)  is  of  the 
same  order  of  complexity  as  of  the  nonlinear  detectors  typical  of  the  LORD 
approach  (Section  1.3),  provided  and  are  available  after  two  orderings  of 
the  data  (the  denominator  is  just  the  number  of  normalized  samples  in  [-/!:,;(;]). 


With  this  background  ,  it  seems  natural  to  preserve  the  structure  of  the  SSQME 
test,  but  with  scale  invariant  location  estimators  and  an  adaptive  threshold  which  is 
derived  from  the  scale  estimate.  Thus  we  propose  the  following  test: 

DECIDE  iff  R{x,y)  >  t„W{x,y)  (5.16) 

where 


^(x,y)  J  [r„W(x)]2+[r„(^)(y)]2  ,  VE(x,y)  ^  [S'„(°)(x)]2+[5„(‘’)(y)]2  (5.17) 
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is  the  one  step  M-estimator  of  Eq.(5.15)  and  is  the  MAD  scale  estimator.  The 
structure  of  the  test  is  shown  in  Fig.  5.2. 


Fig.  5.2  SSQME  test  with  MAD  preliminary  scale  estimate. 
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The  asymptotic  properties  of  the  test  are  summarized  in  the  following.  They  are 
valid  for  all  simultaneous  and  one  step  M-estimators  provided  the  symmetry  properties 
that  were  mentioned  before  hold. 

Proposition  5.2  a)  The  test  (5.16)  is  CFAR  for  any  fixed  density  /  GPq  with  unk¬ 
nown  (variable)  cr.  b)  With  variable  f  ,  the  changes  in  the  asymptotic  false-alarm  proba¬ 
bility  are  governed  only  by  the  normalized  variance  of  the  location  estimates  : 
F'(F,rW)  ^  F(F’,rW)/(5(°)(F))'  ,  but  not  by  the  variance  of  the  scale  estimate,  c) 
The  asymptotic  detection  probability  is  given  by  the  same  expression  as  for  a  fixed 
threshold  SSQME  test:  /?(i^)=a(i^  (  for  a  Rayleigh  signal),  where 

tiA  ^ 

SNR{F)  —  j(i)^  (b.l8) 

is  also  not  a  function  of  the  scale  estimator  variance. 

Proof  1)  Both  and  are  scale  equivariant:  etc.,  from 

which  a)  follows  immediately.  Also,  S'(°)(F’,^)  =  cr5(°)(F^_i),  hence  we  will  omit  the  scale 
subscript  on  F  for  convenience. 

00  00 

2)  The  probability  of  acceptance  oi  H i  is  given  hy  J  f  \y{'^  )dw  J  f  ji{r  )dr  ,  since  R  and 

0  t,w 

W  are  asymptotically  independent  when  the  symmetry  conditions  of  i)  above  are  met.  R 
is  one  sided  exponentially  distributed,  with 

Eo{R)  =  -|(5W(F))2F'(P,rW)  ,  E,{R)  =  Eo{R)[l+SNR]  (5.19) 

by  straightforward  calculation.  Note  that  the  definition  of  SNR  is  justified  (by  analogy 
to  the  classical  case)  as  V  [F  is  also  a  invariant  by  its  definition,  as  long  as  the 

scale  and  location  estimates  are  also  invariant.  Hence  the  input  noise  variance  affects 
SNR  only  through  5(°\P,^)=(7P(°)(Pi).  As  W  is  the  sum  of  the  squares  of  two  indepen¬ 
dent  identically  distributed  Gaussian  r.v.’s  N ,v“),  it  has  a  noncentral  distribution 
with  2  degrees  of  freedom  and  noncentrality  parameter  X=2a^,  c.f.  [13].  Explicitly, 
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(5.20) 


where  /q  is  the  modified  Bessel  function  of  zeroth  order.  Here  a^S^°\F)  and 
==-^  V’(F  under  Hq  and  under  H 

1)  ^  Sj-^\x+c  )=Sj-^\x).  The  desired  probabili¬ 
ties  are  thus  computed  from; 


Pi 


The  evaluation  of  this  integral  is  facilitated  by 


f  exp(-x)lQ(2'/^  )dx 


exp(?/) 


(5.22) 


which  can  be  derived  by  a  change  of  variables  from  the  normalization  of  the  Rician  den¬ 
sity.  The  result  is 


2a^t„ 


Pi  = 


l-)-2f„  V  / Ei  [R  ) 


-Ho  ,  H, 


(5.23) 


As  a‘^/Ei(R)  ^  n  and  u  /£■,•  (R  )  is  not  a  function  of  n  ,  in  order  to  get  a  non-zero  or  as 
n->co  we  must  have  jn  .  Thus  when  n the  terms  that  contain  V{F  ,S^^\F)) 
through  V  cancel  out  and  lim  Pq  =  exp{-t /V^[F  follows  and  proves  b).  The 

n  ~>oo  ^ 

proof  of  c)  is  similar  upon  substitution  of  £'i(R)  from  Eq.(5.19)  into  (5.23)  and  taking 
the  limit  as  n  — ►oo. 


Comments  and  interpretation 

l)Since  the  error  probabilities  are  not  functions  of  the  variance  of  the  scale  estimator, 
should  be  chosen  to  achieve  better  and  flatter  consistency  over  ,  and  not  accord¬ 
ing  to  its  variability.  2)  The  false  alarm  probability  with  the  preliminary  estimate  is 
parametrized  by  which  is  roughly  equal  to  the  variance  under  the  known 

scale  case,  if  S^^\F)=1  and  it  does  not  change  too  much  over  Pq.  Hence  we  have  a  simi- 
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lar  situation  as  in  Chapter  4,  and  the  upper  bound  on  the  false  alarm  probability  is 
approximately  satisfied  if  the  least  favorable  density  is  chosen.  While  it  does  not 
seem  possible  to  give  an  analytic  characterization  of  the  density  that  maximizes 
over  the  e-mixture  family  Pq  as  was  discussed  in  iii)  above,  if  the  noise  den¬ 
sity  is  restricted  to  a  narrower  family  (e.g.,  Gauss-Gauss  or  Gauss-Laplace  e-  mixtures), 
the  supremum  of  the  variance  can  be  found  (at  least  numerically)  and  adjusted  accord¬ 
ingly  to  obtain  an  upper  bound  on  the  a  over  the  restricted  family.  3)With  that  res¬ 
tricted  maximization,  an  approximate  upper  bound  on  the  detection  probability  will  also 
be  obtained  as  follows.  Denote  by  Pq^  the  restricted  noise  family.  Since 
,D  )=F~\3/4)/D  (for  the  MAD  estimator)  changes  over  Por,  its  arbitrary  normal¬ 
ization  D  can  be  used  as  an  optimization  parameter.  Let  V**{D)=  MaxV'{F  ,T^^\D  ) 

^Or 

and  0!Q=exp{-t  /  V**  [D  ))'>a{F  ,D  ).  Under  the  same  approximation  of  Section  4.3  for 
sufficiently  high  desired  probability  of  detection  (,5^0.5),  with  substitution  of  SNR  from 
Eq.  (5.18): 

AF.D)  =  „(f,z,)-/(i+s™(F))  s 

Hence  the  additional  detectability  loss  from  scale  estimation  relative  to  the  worst  case 
where  a  is  known  is  characterized  by  the  effective  SNR  loss 

HF.D)  =  V-iPJlfW 

^(‘oN  ) 

and  D  can  be  chosen  to  minimize  the  maximum  loss  over  Pq^  : 
^  **  —  ArgMm{  Max  L  {F  ,D  )}.  Hence,  a  {f3)  is  bounded  from  above  (below)  over  the 

^  f  Or 

family,  and  the  additional  loss  due  to  scale  estimation  is  low  if  L  {F** 

Figures  5.3a-5.3f  show  the  results  of  such  a  numerical  optimization  over  the  Gauss- 
Gauss  mixture  family,  vs.  the  r.m.s.  power  of  the  contamination  c  .  In  each  figure  the 
dashed  lines  are  of  V'{F  ,  and  the  continuous  lines  are  of  L  {F  ,D  ).  One  set  of 

graphs  in  each  figure  corresponds  to  the  optimization  with  respect  to  D  ,  while  the  other 
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was  obtained  where  D  was  taken  to  get  consistency  at  the  least-favorable  density  of  the  known  a 

k 


D*  = 


k — |-ln 
k 


4(l-e)/o(^) 


fo{k)  1 


1  +  2(l-e) 
4(1  -  6) 


k  4(l-e) 

foik) 


(5.26) 


<■ 


1 


k  4(l-e) 

The  figures  demonstrate  that  there  is  no  additional  loss  (above  that  incurred  from 
^(^Oj7o)/^^)  from  the  scale  estimation  for  e<0.2,  that  the  loss  is  negligible  (about  4%)  for 
6=0.3,  and  it  is  quite  small  (about  26%, corresponding  to  1  dB)  for  e=0.4.  Even  for  the  huge 
uncertainty  e=0.5  the  loss  is  reasonable  (50%)  if  c  <20.  For  e>0.5,  it  seems  that  L  [F  ,D  **) 
is  unbounded.  This  agrees  with  the  “breakdown”  point  of  the  joint  estimation,  as  evaluated  by 
Huber  [3,  pp.  141-146].  Also,  the  loss  difference  between  the  optimal  D  **  and  the  normalization 
D*  IS  quite  small.  As  D  **  >0.6745,  the  efficiency  at  the  nominal  Gaussian  is  somewhat 
sacrificed  to  obtain  better  performance  over  (compare  with  Table  5.1).  Similar  conclusions 
can  be  drawn  from  Figures  5.4a-5.4d  which  are  valid  for  the  Gauss-Laplace  mixture  family.  When 
the  nominal  p.d.f.  is  Gaussian,  the  test  of  Eq.(5.17)  can  be  simplified  by  utilizing  or-trimmed  esti¬ 
mators  in  place  of  the  one  step  location  estimator.  In  that  case,  it  turns  out  that  the  agreement 
with  the  performance  of  the  known  scale  is  even  closer,  see  [48]  for  the  numerical  data. 
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_■  a)  r=0.01 


.  b)  £=0.1 


Fig.  5.3  Normalized  variance  {F  (dashed  lines)  and  effective  SNR  loss 

L(F,D)  of  SSQME  test  with  a  preliminary  scale  estimate.  Gauss-  Gauss  mixture, 
c -=contamination  variance.  D*  -  scale  normalization  of  the  known-cj  least  favorable 
density,  D  ** •  optimal  scale  normalization,  a)  €=0.01  . 
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Fig.  5.3  (continued) 


'^.OC 


^0.00  >40-00  60.00  80.00 

C=5.0.  OF  CONTOMlNfif ION 


OQ 


20.00  >40.00  60.00  90.00 

C  =  S.O.  OF  CON fflM INfl r I  ON 


e)  c=0.4  0  c=0.5 


C>=S.O.  OF  CONiPMlNflT  ION  C  =  $.0.  OF  CON  [PM  I  NPT’I  ON 


-  77  - 


oa)E=0.1  „b)E=0.2 

O  o 


c)  c=0.3 


d)  £=0.4 


^.00  20.00 
C=5.0. 


HO. GO  60.00  60.00 

OF  CONTAMINATION 


Fig.  5.4  Normalized  variance  {F  ,T^^^)/V**  (dashed  lines)  and  effective  SNR  loss 
L  {FjD)  of  SSQME  test  with  a  preliminary  scale  estimate.  Gauss-  Laplace  mixture, 
c *'=contamination  variance.  D*  -  scale  normalization  of  the  known-cr  least  favorable 
density,  D  **  -  optimal  scale  normalization. 
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Pinally,  W6  note  tha>t  the  undcsired  result  where  the  false  alarm  probability  still 
depends  on  /  GPq  (unlike  the  SW-SSQME  version)  is  a  consequence  of  estimating  the 
scale  of  the  input  samples  and  utilizing  it  for  the  threshold;  instead  of  direct  estimation 
of  the  variance  of  the  location  estimator,  (Recall  that  in  the  classical  normal  case  the 
variance  of  the  sample  mean  estimator  differs  only  by  a  factor  of  n  from  the  variance  of 
the  input  samples).  This  might  be  resolved  with  additional  implementation  complexity, 
by  taking  the  adaptive  threshold  to  be  proportional  toF^(x)+F^(y),  where 


7^(x)  =  -L 

n 


1  A 


(5.27) 


and  the  IC  is  given  by  Eq.  (5.13).  Under  the  previously  stated  regularity  and  symmetry 
conditions  this  estimate  of  U(F,r(^))  is  consistent.  Thus  it  is  reasonable  to  assume  that 
V’[F  will  cancel  out  from  the  error  probabilities.  However,  the  error  probabilities 

with  this  estimate  are  analytically  intractable  even  when  n  — >oo  (see  also  [3],  Section 

6.8). 

Kc 

We  feel  that  this  additional  complication  is  probably  unnecessary,  as  is  evident 
from  Figs.  5. 3-5. 4.  Moreover,  Monte-Carlo  finite  sample  simulation  results  indicate  that 
when  the  test  of  Eq.(5.16)  is  implemented  with  the  breakpoints  corresponding  to  the 
density  both  error  probabilities  did  not  deviate  in  any  noticeable  way  from  the 
known  a  case  (i.e.,  when  (5.12)  is  computed  with  5(F)=1),  when  €<0.2  and  n  >48. 
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5-3  Unknown  Signal  Frequency  and  Implementation  Complexity 

In  radar-sonar  detection,  the  frequency  of  the  signal  is  often  unknown,  since  it  is 
proportional  to  the  target  radial  velocity  by  the  Doppler  shift  effect.  In  the  Gaussian 
noise  case,  the  problem  is  commonly  solved  by  employing  a  bank  of  M  contiguous 
coherent  envelope  tests  ,  each  of  the  form  of  Eq.(2.4),  where  the  is  matched  (by 
baseband  conversion)  to  a  Doppler  shift  of  ifrIM  (/,.  =  1/7*  =pulse  repetition  fre¬ 
quency).  If  the  Doppler  shift  is  spread  uniformly  over  [0,/^],  almost  optimal  performance 
over  this  range  is  obtained  by  implementing  the  contiguous  bank  with  a  weighted  FFT 
processor  of  length  n ,  as  the  envelope  of  the  FFT  filters  outputs  is  almost  fiat.^  This  will 
serve  in  the  following  as  a  basis  for  comparison  of  implementation  complexity  of  the 
various  robust  tests. 

In  a  similar  manner,  robust  tests  for  unknown  frequency  can  be  constructed  as  a 
bank  of  M  contiguous  SSQME  tests.  Specifically,  in  the  channel,  A:  =0,1,...M-1,  the 
input  observables  of  Eqs.(2.5-2.6)  are  transformed  according  to 

li  {k  )=Re  [{xi+jyi  )exp{-j  2mk  /M)]  ,  {k  )=Im  [(a:,-  4-;V,-  )exp(-y  27rik  /M)] 

and  then  M  parallel  SSQME  tests  are  performed  on  {I(A:  ),Q(/: )},  see  Fig.  A.  In  this  way, 
for  the  M  discrete  frequencies  kf  ^/M  in  the  uncertainty  range  [0,/,.],  the  performance 
will  be  optimal  as  before,  with  some  loss  for  straddling  frequencies.  The  question  now 
arises  if  this  loss  is  comparable  to  that  of  the  FFT  processor  when  M=n  .  At  first  glance 
we  might  suspect  that  it  would  be  larger  due  to  the  nonlinear  processing,  but  it  turns  to 
be  identical  at  least  for  large  sample  sizes. 

Proposition  5.3  Let  the  frequency  shift  of  the  signal  be  /^=A/,./n  (  A  measures 
the  frequency  deviation  from  the  nearest  filter,  normalized  by  the  filter  bandwidth)  and 
let  n  — >-00.  Then,  the  detection  probability  of  the  various  SSQME  tests  is  given  by  the 
same  expressions  as  for  A  =  0,  where  the  effective  SNR  is  attenuated  by  G  (A)  which  is 


^Flatness  of  about  1  dB  is  achieved  with  Hamming  weighting. 
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identical  to  that  of  the  FFT  processor: 


sin(;rA) 

;rA 


(5,28) 


G(A)  = 

The  proposition  is  proved  in  appendix  D;  the  key  steps  are  showing  asymptotic  nor¬ 
mality  for  M-estimators  of  a  sequence  of  r.v/s  with  unequal  means,  and  that  the  I  and  Q 
estimators  are  asymptotically  jointly  Gaussian  and  independent,  even  though  /,♦  is  in 
general  not  independent  of  Q,*.  The  implications  should  be  clear:  not  only  that  it  is  not 
necessary  to  construct  a  bank  of  more  than  n  contiguous  SSQME  tests,  but  it  is  also 
possible  to  apply  the  same  weighting  techniques  that  are  employed  with  the  FFT  proces¬ 
sor  to  get  a  flatter  performance  over  the  frequency  range  (at  a  price  of  reduced  resolu¬ 
tion  and  reduced  SNR  at  /^=0).  (See  appendix  D  for  the  details.) 

It  was  not  possible  to  prove  a  similar  result  for  the  test  implemented  with  the  a- 
trimmed  means.  One  of  the  difficulties  is  that,  apparently,  there  is  not  any  general  result 
in  the  statistical  literature  on  the  distribution  of  L-statistics  when  the  samples  are  not 
identically  distributed.  However,  the  simulation  results  of  Chap  5.4  indicate  that  the 
above  proposition  is  essentially  valid.  To  support  it.  Fig.  B  depicts  the  frequency 
response  G^a('^);  when  no  noise  is  present;  this  can  be  viewed  as  an  approximation  for 
SNR—^oo.  (?a(A)  was  numerically  computed  as  the  average  (with  respect  to  the  uniform 
phase)  of  the  test  statistic 


GJA)  =  E^{I,  2(x,  y;  A)  +  Q,  ,y;  0,  A)} 


noise  =  0 

with  n  =16  (the  dependency  on  n  was  negligible  above  it).  It  is  seen  that  the  SNR 
attenuation  decreases  with  more  trimming,  and  is  always  less  than  that  of  the  linear 
quadrature  detector.  From  the  simulations  results,  the  presence  of  noise  seems  to  com¬ 
pensate  for  this  superior  behavior. 

Implementation  complexity 

The  major  increase  in  complexity  stems  from  the  nonlinear  processing  of  the 
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SSQME  test.  Thus,  the  number  of  operations  required  for  a  single  frequency  must  be 
multiplied  by  n  resulting  in  O(n^)  for  the  full  range,  compared  to  nlogn  for  the  linear 
FFT  processor  (in  the  following,  logn=log2n).  It  does  not  seem  possible  to  perform  some 
of  the  nonlinear  operations  before  the  n  frequency  conversions. 

For  purposes  of  simplicity,  we  assume  that  multiplications,  additions  and  comparis¬ 
ons  are  equally  costly.  Hence,  the  FFT  processor  requires  O  (5n  logn -{-4n )  real  operations. 
The  SSQME  test  for  the  nominally  Gaussian  noise  is  based  on  a-trimmed  means.  If  ord¬ 
ering  the  data  is  done  by  the  QUICKSORT  algorithm  [39],  the  expected  number  of 
operations  for  an  i.i.d.  sequence  is  0(n  logn)  ^  Hence  a-  trimming  requires 
O  (n  logn -h  c  (a),l<c  (a)<n.  With  it  available,  the  MAD  scale  estimator  requires 
O  {nlogn +2n)  operations  in  addition,  and  the  1-step  estimator  for  the  non-Gaussian 
nominal  density  needs  O(lln)  more,  assuming  that  the  evaluation  of  a  nonlinear  func¬ 
tion  is  done  by  interpolation  between  two  stored  values.  With  these,  the  order  of  com¬ 
plexity  for  the  various  tests  of  the  previous  section  is  easily  obtained  by  counting.  For 
the  SW  version  we  do  not  count  the  operations  in  the  adjacent  M  reference  cells,  as  the 
SW  is  performed  by  sequentially  repeating  the  algorithm  on  adjacent  spatial  cells  and  it 
is  only  required  to  keep  in  memory  the  outcomes  of  operations  that  have  already  been 
performed.  Table  5.4  summarizes  the  order  of  the  number  of  operations.  The  last  entry 
belongs  to  the  narrow-band  Wilcoxon  nonparametric  test  [40]  which  is  a  candidate  for 
comparison  with  the  robust  tests.  The  test  statistic  for  a  single  frequency  is 

RNBw{^,y)  =  (5.29) 

1=1  1=1 

where  is  the  rank  of  |  a:,-  |  in  |  |  ,  |  2^2  |  \  \  }  U{xi)  is  the  unity  step 

function.  The  test  has  the  same  order  of  complexity  as  the  SSQME  test,  which  substan¬ 
tially  outperforms  it  as  will  be  shown  in  the  next  section. 


^  QUICKSORT  is  a  random  algorithm,  it  can  require  up  toO  (n^)  operations. 
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Type  of  Test 

Order  of  Complexity 

FFT  processor 

5n  logn  +  4n 

SW-SSQME  for  nominal  Gaussian  noise 

2n^logn  4- 2n  (c  (a)-f  l) 

SSQME  with  preliminary  scale  estimate 
for  nominal  Gaussian  noise 

47i^(logn  -h  1)  -h  2n  (c  (a)-l-2) 

SW-SSQME  for  nominal  non-Gaussian 
noise 

4n^(logn  -h  12)  -f  8n 

SSQME  with  preliminary  scale  estimate 
for  nominal  non-Gaussian  noise 

4n^(logn  -1-  12)  -h  lln 

Wilcoxon  NB 

2n^(logn  -h  3)  -h  4n 

Table  5.4a  Order  of  complexity  for  various  tests  of  narrow-band  input  of  length  n. 

As  an  example  for  the  order  of  the  actual  required  processing  rates,  consider  the  fol¬ 
lowing  radar  case.  With  a  SW  test  and  a=0.25,  the  number  of  operations  per  frequency 
channel  and  range  bin  is  Oj  (n  )=n  (21ogn -h7)+4.  The  total  number  for  the  entire  range 
domain  is  Of  (n)r/r,  where  r  is  the  width  of  the  range  bin  and  7=1//,.  is  the  pulse 
repetition  interval.  The  processing  rate  per  frequency  channel  is  the  above  divided  by 
the  dwell  time  at  each  spatial  direction,  nT .  Thus,  it  equals  Of  (n)/(nr).  Table  5.4b 
compares  the  processing  rates,  in  million  operations  per  second  (MOPS),  for  r  =ifis 
(corresponding  to  range  resolution  of  150  meters),  for  the  robust  SW  and  FFT  detectors. 
The  robust  detector  requires  a  factor  of  10  to  40  more  than  that  required  by  the  FFT 
processor,  and  the  actual  rates  are  higher  than  possible  with  current  technology.  How¬ 
ever,  they  are  within  the  projected  capability  of  VLSI  and  VHSIC  technologies  for  the 
near  future.  Moreover,  the  “bank  of  Doppler  test”  structure  is  convenient  for  parallel 
implementation,  thus  only  the  numbers  at  the  left  part  of  the  table  need  be  considered. 


MOPS 

per  channel 

total 

MOPS 

n 

robust 

FFT 

robust 

FFT 

16 

15.2 

1.50 

244 

24 

32 

17.1 

.91 

548 

29.1 

64 

19.1 

.53 

1220 

33.9 

Table  5.4b  Processing  rates  for  a  radar  example. 


Figure  A  Bank  of  n  SSQME  tests  for  unknown  Doppler  frequency. 
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Figure  B  Frequency  response  of  the  SW  test  with  a-trimmed  estimators. 
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5.4  Finite  Sample  Simulation  Results 

The  asymptotic  approach  to  robust  statistical  procedures  is  an  unavoidable  neces¬ 
sity,  as  the  n  assumption  allows  one  to  invoke  the  central  limit  theorem  in  order  to 
obtain  explicit  functionals  which  parametrize  the  performance  of  the  procedure.  Thus, 
optimization  by  some  criterion  becomes  possible.  Although  asymptotically  one  could 
find  true  optimal  procedures  by  estimating  the  underlying  distribution,  this  usually 
requires  a  prohibitive  number  of  samples  [49].  Thus,  the  central  reason  for  the  asymp¬ 
totic  formulation  is  the  hope  that  robust  estimation  and  testing  procedures  will  approach 
their  asymptotic  behavior  quickly.  The  “Princeton  Monte  Carlo  Study”  of  Andrews  et 
al.  [35]  showed  that  this  is  indeed  the  case  for  the  variance  of  M  and  other  robust  esti¬ 
mators  of  location.  For  detection,  we  need  also  to  demonstrate  the  tail  behavior. 

In  most  communication  engineering  applications,  n  will  be  quite  small.  This  is  true 
particularly  for  radar  -  sonar  systems,  where  such  considerations  as  the  desired  non 
-ambiguous  detection  range,  angular  resolution  and  minimization  of  the  required  time  to 
search  the  detection  spatial  sector,  call  for  decreasing  n  .  Moreover,  we  have  seen  time 
and  again  through  this  work  that  detectability  of  coherent  signals  is  governed  (at  least 
asymptotically)  by  the  integrated  SNR  nA^.  In  active  systems,  this  product  is  propor¬ 
tional  to  the  average  transmitter  power,  which  usually  is  the  fundamental  constraint 
rather  than  the  peak  power  A  Therefore,  at  a  given  time  period  the  same  detectability 
can  be  maintained  by  increasing  A  and  reducing  n  while  keeping  constant,  thus 
achieving  the  other  important  systems  objectives.  As  a  result  of  that,  most  radar  sys¬ 
tems  are  designed  with  n=0  (10-50)  or  even  smaller  for  long  range  systems.  Conse¬ 
quently,  the  usefulness  of  procedures  resulting  from  asymptotic  analysis  remains  ques¬ 
tionable  without  small  -  sample  performance  analysis. 

Unfortunately,  analytic  tools  for  studying  finite  sample  performance  are  not  avail¬ 
able.  Results  in  the  spirit  of  the  Berry-Essen  theorem  [37],  which  bounds  the  deviation 
from  Gaussianity  of  the  distribution  of  the  sample  mean  estimator  are  non-existent  for 
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M-estimators.  This  is  so  because  it  is  not  possible  to  obtain  an  analytic  expression  for 
the  finite  sample  moments.  (Notice  that  its  asymptotic  variance  was  derived  indirectly 
through  the  convergence  of  its  distribution).  We  must  resort  to  either  numerical  or 
simulation  studies  which  do  not  analytically  exhibit  the  rate  of  convergence. 

For  M-estimators  with  a  monotone  nonlinearity  I,  the  distribution  can  be  com¬ 
puted  from  an  n-fold  numerical  convolution  since  Prob  {A  <a}= 
Prob  {Y^l{xj-a)<0}.  For  our  SSQME  test  more  integrations  are  required  since 
R  =A  ^  j  and  A  and  (j)  have  to  be  averaged  out.  Field  and  Hampell  [42]  recently 
showed  how  to  alleviate  the  enormous  amount  of  computations  by  their  so  called  small 
sample  asymptotics”  technique,  which  is  closely  related  to  Daniel’s  [43]  saddlepoint 
approximation.  Though  this  technique  was  shown  to  be  very  accurate  down  to  sample 
sizes  of  3,  even  in  the  extreme  tails,  it  is  still  numerical  in  nature  and  involves  a  substan¬ 
tial  programming  effort.  Therefore  we  have  not  proceeded  in  this  direction  but  per¬ 
formed  instead  a  Monte-Carlo  study  -  programming  is  relatively  straightforward  and 
flexible,  and  the  empirical  distribution  estimates  are  known  to  be  unbiased  and  converge 
to  the  true  distribution  with  a  large  number  of  repetitions, 

Heuristically,  the  SSQME  tests  based  on  robust  amplitude  estimation  should  con¬ 
verge  at  a  faster  rate  than  the  locally  optimal  robust  detector  (Section  4,4)  based  on 
weak  signal  assumptions.  The  latter  incorporates  a  hard  “non  tracking”  nonlinearity; 
the  distribution  of  the  test  statistic  has  finite  support  (when  n  is  finite)  with  a  point 
mass  at  the  upper  end  of  the  interval.  In  contrast,  the  M-estimator  can  take  any  value 
on  and  its  distribution  will  be  smoother  and  hence  converge  faster  to  Gaussian.  This 
is  most  significant  under  Hi  when  the  distribution  of  the  LORD  test  statistic  will  be 
mostly  concentrated  around  the  boundary  of  the  intervals  for  large  SNR. 

When  adopting  the  asymptotic  theory  testing  structure  for  finite  sample  size,  the 
amount  of  trimming  or  limiting  must  be  increased  if  very  small  P is  desired,  since  we 
are  dealing  with  the  tail  of  the  distribution  where  deviations  from  theory  are  larger.  This 
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is  explained  more  precisely  as  follows.  Let  Af.  represent  the  event  that  max  {  number  of 
outliers  in  the  I  observations,  number  of  outliers  in  the  Q  observations  }  =  A:,Since  these 
events  are  mutually  exclusive  and  their  union  is  the  certain  event  we  can  write 


(5.30) 

Pj,=Pr{R>t  }='e  Fr  {R  >t  \  A^jP  {Ak)+  E  Pr  {R  >t  \  A,,  }P  {A^) 

k  =0  k  =2[n  a|+l 

Thus,  a  lower  bound  for  is  the  second  term.  In  it,  the  number  of  outliers  exceeds' 
the  trimming  capability  of  either  estimator;  hence,  if  t  is  fixed  according  to  asymptotic 
theory  Pr{R'>t  \Af^}  will  be  large  and  close  to  1,  and  approximately 
Pfa=  Zl  P{^k)'  Assuming  that  the  contaminated  samples  come  from  a  switching 

k  >2[n  a]+l 

model,  the  probability  of  having  k  outliers  out  of  n  in  either  channel  is  binomial; 


Pk 


n' 

k 


,  and  due  to  independence  of  the  quadrature  samples 


P(yl,)  =  n'  +  2P/E^/  (5.31) 

The  following  table  exhibits  this  approximation  for  n—16  and  e=0,l.  With  the  asymp¬ 
totically  optimal  a=0.164,  Pf^  can  not  be  smaller  than  roughly  10“^,  while  a=0.27  is 
sufficient  for  10’®;  simulation  results  have  validated  this  approximation. 


a 

.125 

.1875 

.25 

.3125 

.375 

k„,  =2\n  Qfl+l 

5 

7 

9 

11 

13 

E  Pi^k) 

3.37- 10"2 

1.01-10"^ 

1.18-10-® 

5.41-10-* 

8.3610-^^ 

Several  versions  of  scale-invariant  detectors  of  Section  5.2  were  studied,  for  the 
nominally  Gaussian  case.  They  will  be  denoted  Dl  -  D5  as  follows: 

(Dl)  SW-SSQME  test  of  Eq.(5,5)  based  on  a-  trimmed  estimators  with  the  number  of 
reference  cells  M  =4. 

(D2)  like  Dl  but  M— >oo  (i.e.,the  estimation  of  the  variance  of  the  test  statistic  con- 


-  88  - 


verges  to  the  true  value).  For  that,  the  reference  statistics  fF(u,v)  of  the  Dl  simu¬ 
lation  was  accumulated  over  all  runs  to  produce  the  adaptive  threshold.  Thus,  a 
huge  saving  was  possible. 

(D3)  SW-SSQME  test  with  a  “one-step”  location  estimators  of  Eq.(5.15)  based  on  the 
MAD  scale  estimate;  its  normalization  factor  D=0.632  was  adjusted  to  get  (empiri¬ 
cally)  consistency  of  the  scale  estimator  ES  (x)=l  at  the  Gaussian  (this  value  is 
somewhat  lower  than  predicted  from  the  asymptotic  theory  ).  The  nonlinearity  was 
the  soft  limiter  I  [x  -k  ,k  ). 

(D4)  IS-SSQME  test  of  Eqs.  (5.16)-(5,17)  -  no  reference  cells  -  based  on  the  same  location 
and  scale  estimators  of  D3  . 

(D5)  SW  version  of  the  LORD  from  Section  4.4  (obtained  from  weak  signal  assumptions 
by  replacing  the  M  -  estimators  with  the  nonlinearity-integrator  of  Eq.  (4.29)), 

For  D1-D4,  both  a  and  k  were  adjusted  to  obtain  better  false-alarm  control  as  discussed 
before;  for  D5  we  chose  k  as  the  optimal  for  the  given  e  from  the  asymptotic  theory. 

For  each  test,  two  simulation  programs  were  built.  The  first  is  for  Hq  and  its  out¬ 
put  are  graphs  of  the  false  alarm  probability  vs.  the  normalized  threshold,  as  well  as  the 
first  two  moments  of  the  location  and  scale  estimators.  The  second  program  is  for  Hi, 
It  simulates  the  detection  probability  as  a  function  of  the  SNR  for  several  levels  of  the 
false  alarm  probability  and  for  a  Swerling  I  target  model,  i.e,,  coherent  narrowband  sig¬ 
nal  with  Rayleigh  distributed  amplitude  and  uniform  phase  (see  Section  4.3,  case  a).  In 
each  of  the  simulations,  the  test  was  subjected  to  four  different  noise  p.d.f.’s  from  an  e- 
mixture  family  where  the  nominal  is  normal  Gaussian: 

(/i)  ^=0 

(/  2)  two  point  masses  at  ±15 

(/  3)  0‘1  ^nd  a  Gaussian  contamination  with  variance  c^=100 


-  89  - 


(/  4)  ^=0,01  and  a  Gaussian  contamination  with  c^=900 

In  the  sequel,  we  shall  use  these  numbers  to  identify  the  situation.  The  input  variances 
are  thus  1,  23.4,  10.9  and  9.99  respectively  -  all  e^O  cases  represent  large,  heavy  tailed 
contaminations  and  would  have  resulted  in  a  substantial  degradation  of  performance  if 
the  unrobustified  envelope  detector  was  employed  (see  Section  2.2).  The  e-  mixture  r.v.’s 
were  generated  from  a  switching  device:  at  each  step,  a  normal  r.v  t/j  was  generated 
together  with  a  binomial  r.v  5,-  that  is  equal  to  1  with  probability  e;  =t/,-  if 
and  x^=  c  (or  x^=±15  with  probability  0.5  for  f  2)  otherwise.  Almost  all  simula¬ 
tions  were  for  n  =16. 

Table  5.5  displays  the  Monte-  Carlo  variances  of  the  a-  trimmed  estimator  for  vari¬ 
ous  Qf’s.  For  comparison,  from  the  asymptotic  theory  at  the  least-favorable  p.d.f. 
F^(€=0.1)  =1.49,  and  for  a=0.225  we  computed  from  Eq.(5.3) 
F(l)=1.17, F(2)=1.51,  K(3)=1.46,  V^(4)=1.2.  This  good  correspondence  between 
asymptotic  theory  and  finite  sample  simulation  (for  n=20)  was  observed  before  by 
Andrews  et.  al.  [35].  The  asymptotically  optimal  trimming  ratio  is  a(€=0.1)=  0.164;  as 
was  discussed  before  it  is  necessary  to  apply  more  trimming  with  small  sample  size  when 
very  small  false  alarm  probabilities  are  desired.  The  simulations  indicated  that 
0^=0. 225  to  0.3  is  sufficient;  the  table  shows  that  except  for  /  ^  for  this  a  range  the  vari¬ 
ances  are  roughly  as  those  for  a  =  0.164  (or  even  smaller).  The  results  suggest  that  the 
detectability  loss  compared  to  theory  is  small.  This  has  been  verified  from  the  simu¬ 
lation  results,  which  will  be  presented  below.  Similar  conclusions  can  be  drawn  from 
Table  5.6,  which  is  for  the  “one-step”  estimator;  here  the  asymptotically  optimal  limit¬ 
ing  is  A:  (€=0.1)  =  1.14,  and  very  small  losses  (  if  any  )  are  incurred  by  taking  k  =0.75. 
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Of 

V{f,) 

V[f,) 

V{f,) 

V{f,) 

0.164 

1.09 

1.43 

1.60 

1.13 

0.225 

1.15 

1.34 

1.53 

1.18 

0.25 

1.17 

1.35 

1.51 

1.20 

0.3 

1.21 

1.41 

1.55 

1.25 

0.4 

1.36 

1.56 

1.69 

1.39 

0.5 

1.47 

1.68 

1.81 

1.50 

Table  5.5  Monte-Carlo  variances  of  the  a-  trimmed  estimator.  n=16,  20,000  runs. 


k 

Vif A 

V(f,) 

V{f,) 

Vif^) 

1.14 

1.08 

1.29 

1.53 

1.12 

0.9 

1.13 

1.31 

1.51 

1.16 

0.75 

1.16 

1.34 

1.50 

1.24 

0.5 

1.29 

1.46 

1.62 

1.32 

0.3 

1.41 

1.60 

1.74 

1.42 

Table  5.6  Monte-Carlo  variances  of  the  “one  -  step”  estimator  with  MAD  scale  estima¬ 
tor.  n=16,  20,000  runs. 

Probability  of  false  alarm  curves  are  shown  in  Figures  5.5  -  5.16.  In  all  these 
figures,  the  smooth  curves  marked  “T”  are  computed  from  the  asymptotic  theory  - 
Eqs.(5.6)  and  (5.23),  and  the  numbered  curves  are  the  simulation  results  for  the  densities 
/  i~  /  4*  Fig.  5.5  corresponds  to  Dl  and  5.6  to  D2,  with  a=0.165.  We  see  clearly  that 
here  a  is  too  large,  since  for  the  more  contaminated  cases  is  much  higher  than 

desired.  This  problem  is  solved  by  taking  a=0.225  as  can  be  seen  from  Figs.  5. 7-5. 8,  In 
the  first  one,  the  Monte-Carlo  curves  are  all  remarkably  close  to  the  theoretic  curve, 
within  the  range  where  the  simulation  results  are  reliable  ^4/n,.  =2T0"^.  The 

test  is  clearly  CFAR,  for  very  different  noise  p.d.f.^s. 

In  Figure  5.8  Py^  is  even  somewhat  smaller  than  the  theoretic,  (this  means  that  the  dis¬ 
tribution  of  the  test  statistic  R  has  shorter  tails  compared  to  the  one  sided  exponential). 

^  The  empirical  distribution  for  rij.  repetition  of  the  Monte-Carlo  experiment 
(a:  )=number{a:{  >  a:  }/n,. ,  is  a  binomial  r.v.  with  mean  =1-P(x)  and  variance 

=P  (x  )(l-P  (x  ))/n,. .  Therefore,  for  Py^^  «l,for  yJVar  (Pya)/Pya  ^0.5  one  obtains 
P/a  >4/n,.. 


-  91  - 


Thus,  if  the  threshold  is  fixed  according  to  the  asymptotic  theory,  Pf^  is  bounded  from 
above  as  desired.  The  same  is  valid  for  a=0.3  as  can  be  seen  from  Figs.  5.9-10.  The 
curves  for  D3  are  shown  in  Figs.  5.11-12;  in  the  first  one,  the  limiter  breakpoint  is 
^=1.14  according  to  the  asymptotic  theory  -  it  is  again  seen  that  the  limiting  is  not 
‘^hard”  enough,  as  Pf^  for  /  3  is  higher  than  the  theoretic  bound  when  Pf^  <0.01.  This 
problem  is  corrected  by  taking  A:  =0.75  as  can  be  seen  from  Fig,  5.12. 

Results  for  D4  are  displayed  in  Figs.  5.13-5.14;  notice  the  two  sets  of  curves  in  Fig. 
5.13:  the  first  is  of  \ogProb  {R  {x,y)>{t  ln)V*  lF(x,y)}  where  V*=IA9,  which  should 
approach  a  straight  line  when  n  —yoo  according  to  proposition  5.2.  In  the  second  case  the 
scale  is  contracted  by  taking  ^=3.  Here  asymptotic  theory  is  no  longer  valid:  the 
curves  are  not  linear  and  for  achieving  small  P/^’s  higher  threshold  settings  are 
required.  These  higher  settings,  in  turn,  would  cause  higher  detectability  losses.  This 
deviation  from  asymptotic  theory  is  not  surprising,  and  is  equivalent  to  the  behavior  of 
the  ‘‘Quadrature  t-test,”  which  is  the  maximum  likelihood  test  for  a  coherent  nar¬ 
rowband  signal  in  Gaussian  noise  of  unknown  variance 

H 

(i  E  “i )"  +  (^  S  f  i  lS(*i  -  +  £(k  -  I  (5.32) 

71  71  II Q  71 

For  small  n,  the  right  hand  side  cannot  be  approximated  by  a  Gaussian  distribution  (it 
is  chi-squared  for  all  n  ),  and  also  the  dependency  between  the  right  and  left  sides  of  Eq. 
(5.29)  cannot  be  neglected  as  we  did  in  proving  Proposition  5.2.  In  Fig.  5.14  n  was 
increased  to  48,  in  order  to  check  convergence.  It  is  evident  that  the  curves  almost  con¬ 
verge  to  asymptotic  theory,  although  some  curvature  is  still  noticed. 

Finally,  Fig.  5.15  is  for  the  LORD  test  D5.  Here,  as  in  Fig.  5.8,  a  tendency  for 
shorter  tails  is  observed.  This  could  be  heuristically  described  by  the  hard  limiting  that 
restricts  the  distribution  of  the  test  statistic  to  have  a  finite  support. 

Dramatic  improvement  in  false  alarms  control  over  the  unrobustified  SW  detector 
Eq.  (2.8)  is  evident  when  comparing  the  previous  figures  with  Fig.  5.16.  When  it  operates 
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in  a  Gauss-Gauss  (€=0.1,  c  =10  )  noise,  is  intolerably  increased  from  the  design 
lO”"*  to  about  0.1  when  M=4,  and  from  10“®  to  2-10“^  when  M  — ►oo.  Compare  also  Fig. 
2.5  for  €=0.01.  Thus  we  can  confidently  conclude  that  the  various  SW-SSQME  tests 
essentially  maintain  CFAR  according  to  the  asymptotic  theory  of  Section  5.2  even  for  n 
as  small  as  16  with  Pf^  down  to  (at  least)  10'^.  For  the  test  without  reference  samples 
D4,  n  is  not  much  larger  than  48.  Based  on  these  curves,  the  detection  thresholds  for 
the  Hi  simulation  were  simply  taken  from  the  smooth  theoretical  curves,  and  extrapo¬ 
lated  from  them  for  F^a=10”^.  Future  work  could  utilize  reduced  variance  sampling 
techniques  such  as  “importance  sampling  ”[44]  to  validate  this  extrapolation. 

Probability  of  detection  curves  for  Dl,  D2  and  D5  are  shown  in  Figs.  5.17-27.  Here 
5000  repetitions  are  sufficient  to  get  Var^^\P^)/P^  <0A  for  .02<P^<0.98.  The 
abscissa  is  the  effective  integrated  SNR  =nE  {A")/a^  where  a"  is  the  variance  of  the 
nominal  Gaussian  p.d.f.  The  known  signal  frequency  case  is  depicted  in  Figs.  5.17-20. 
The  Monte-Carlo  curves  are  clearly  bounded  from  the  left  by  the  computed  for  the 
unrobustified  SW  detector  when  €=0,  and  from  the  right  by  the  maximin  lower  bound 
computed  from  asymptotic  theory,  Eq.  (5.6),  with  V*  of  the  least-favorable  p.d.f.  /  * , 
Here  again  the  fit  between  asymptotic  theory  and  small  sample  performance  is  striking: 
the  curves  for  /  i-/  4  are  ranked  exactly  according  to  the  estimation  variances  of  Table 
5.5,  and  the  simulated  P^  values  generally  differ  by  no  more  than  a  few  percent  when 
the  simulation  variances  are  substituted  in  Eq.  (5.6)  or  (5.23).  Equivalently,  the  horizon¬ 
tal  differences  between  curves  are  roughly  the  dB  values  of  the  variances  from  Table  5.5. 
We  also  note  that  the  difference  between  a=0.225  and  a=0.3  is  very  small,  again 
according  to  the  variance  difference. 

In  contrast  with  the  above  results,  the  unrobustified  SW  detector  of  Eq.(2.8)  was 
found  to  suffer  substantial  detectability  losses.  They  were  almost  identical  to  the 
increased  input  variances  of  /  2  ”  /  4?  with  respect  to  /  ;j^. 
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Some  non-parametric  schemes  were  developed  for  detection  of  narrowband  signals 
in  unknown  noise;  the  narrowband  Wilcoxon  detector,  Eq,(5.29),  was  proposed  by  Car¬ 
lyle  and  its  Monte-Carlo  performance  in  purely  Gaussian  noise  was  studied  by  Hansen 
[40],  It  was  found  that  the  SNR  losses  compared  to  the  UMP  detector  for  n=16  and 
Pfa=lO^^  are  5  and  14  dB,  for  P^=0.5  and  0.9,  respectively  (smaller  losses  are  incurred 
for  larger  n  )  -  compared  to  merely  0.6  db  for  D2  in  Fig.  5.18  !  Furthermore,  the  perfor¬ 
mance  in  non-Gaussian  noise  has  not  been  studied. 

The  performance  of  the  detector  without  reference  samples,  D4  with  n  =  16,  is 
shown  in  Fig.  5.20.  Although  convergence  to  the  asymptotic  theory  is  not  yet  reached 
(in  accordance  with  the  higher  required  threshold  values  -  recall  Fig.  5.13),  the  detector 
is  clearly  robust.  This  is  evident  from  the  small  differences  between  the  curves  for 
/  1  -  /  4.  By  comparison  with  Fig.  5.18,  the  SNR  losses  are  roughly  1.8  and  5.8  dB  for 

=  10'^  and  10^^,  respectively,  independent  of  the  P^.  However,  when  comparing  it 
with  the  SW-SSQME  test  Dl  of  only  4  reference  channels,  D4  outperforms  it  by  roughly 
1  and  3.4  dB,  for  =  10'^  and  10"®,  respectively.  By  virtue  of  the  shift  invariance  of 
the  one-step  location  estimator,  if  convergence  to  Gaussianity  is  reached  under  the 
performance  under  Hi  will  agree  with  the  asymptotic  prediction.  Thus,  when  n  >  50 
D4  would  hardly  suffer  any  detectability  losses  compared  to  the  UMP  detector  for  the 
least  favorable  noise  p.d.f. 

In  contrast  to  the  excellent  performance  of  the  structures  proposed  in  this  work, 
Fig.  5.21  shows  that  while  the  LORD  test  D5  performs  roughly  the  same  for  Pya==10"^, 
it  is  inf  ertor  for  Py^=10"®;  its  SNR  should  be  higher  by  4.6  dB  to  achieve  P^=0.9  at 
/  3.  This  can  be  simply  explained:  when  the  SNR  is  sufficiently  high  for  high  P^ ,  the 
“non-tracking”  hard  limiter  of  Eq.  (4.29)  prevents  the  distribution  of  R  from  tilting 
towards  higher  values.  Also,  asymptotic  theory  does  not  properly  describe  its  behavior 
at  the  most  interesting  zone  P^— ^1.  (Recall  that  SNR  — >0  was  necessary  for  the  deriva¬ 
tion  of  its  asymptotic  distribution,  but  not  for  the  tests  which  are  based  on  true  robust 
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estimation  of  the  arbitrary  amplitude).  Notice  also  that  the  LORD  test  D5  is  not  scale 
invariant,  and  the  simulation  results  for  it  are  valid  only  when  the  variance  of  the  nomi¬ 
nal  equal  unity. 

The  performance  with  unmatched  signal  frequency  when  a  bank  of  n  contiguous 
SVV-SSQME  tests  is  utilized  for  unknown  frequency  (Section  5.3  ),  are  depicted  in  Figs. 
5.22-24  where  the  normalized  deviation  A/  An  Af  ^  /  f  ^  =0.25,  and  in  Figs.  5.25-27  for 
A/  =0.5  (halfway  between  the  central  frequency  of  adjacent  tests  ).  The  mismatch  loss 
is  almost  indistinguishable  from  that  of  Proposition  5.3  -  i.e.,  the  same  loss  incurred  by 
the  FFT  processor  for  a  frequency  unmatched  signal  in  pure  Gaussian  noise.  While  this 
loss  is  reasonable  for  Dl  and  D2,  0.91  dB  and  3.9  dB  for  A/  =0.25  and  0.5  respectively, 
the  weak  signal  locally  -  optimal  test  D5  totally  breaks  down  under  this  situation,  as 
demonstrated  by  Figs.  5.24  and  5.27.  We  note  that  this  loss  for  and  D  2  can  be 
reduced  by  weighting  the  input  samples.  (See  appendix  D.) 

Another  interesting  conclusion  can  be  drawn.  In  many  practical  cases,  the  noise 
p.d.f.  is  variance  constrained,  as  it  is  proportional  to  the  total  (nominal  plus  contaminat¬ 
ing  components)  noise  power.  This  is  true  in  particular  for  the  clutter  (reverberation) 
environment,  where  the  noise  power  is  proportional  to  the  radar  (sonar)  transmitted 
power.  A  question  now  arises  as  to  which  variance  constrained  noise  p.d.f.  constitutes 
the  worst  detection  environment.  Consider  for  example  Fig.  5.18  and  /  3  for  which  the 
detector  D2  is  essentially  optimal.  However,  the  variance  of  /  3  is  10.9.  Consequently,  in 
a  Gaussian  noise  environment  of  equal  power  (t^=10.9,  the  UMP  detector  would  perform 
worse  by  10.9  -1.5  =  9.4  dB  than  the  detector  D2  !  This  turns  to  be  true  in  general.  In 
[45],  we  show  that  the  Gaussian  is  the  least  favorable  variance  constrained  p.d.f.  for 
detection  of  lowpass  as  well  as  narrowband  coherent  signals  with  random  parameters. 
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Fig.  5.5  False  alarm  probability.  Test  -  Dl  ,  a=.164. 
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Fig.  5.6  False  alarm  probability.  Test  -  D2  ,  a=.164. 
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Fig.  5.7  False  alarm  probability.  Test  -  Dl  ,  a=.225. 


Fig.  5.8  False  alarm  probability.  Test  -  D2  ,  a=.225. 
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Fig.  5.9  False  alarm  probability.  Test  -  Dl  ,  q;=.3. 


.10  False  alarm  probability.  Test  -  D2  ,  q;=.3. 
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Fig.  5.11  False  alarm  probability.  Test  -  D3  ,  K=1.14. 
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Fig.  5.12  False  alarm  probability.  Test  -  D3  ,  K=.75. 
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Fig.  5.13  False  alarm  probability.  Test  -  D4  ,  K=.85,  n=16. 
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Fig.  5.14  False  alarm  probability.  Test  -  D4  ,  K=.9,  a=48. 
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Fig.  5.15  False  alarm  probability.  Test  -  D5  ,  K=1.14. 
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Fig.  5.16  False  alarm  probability.  Unrobustified  envelope  detector,  Gauss-Gauss  mix¬ 
ture  (e=0.1,c  =10). 
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Fig.  5.17  Detection  probability.  Test  -  Dl,  q;=.225,A/  =0 
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Fig.  5.18  Detection  probability.  Test  -  D2,  a=.225,A/  =0 
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Fig.  5.19  Detection  probability.  Test  -  Dl,  a=.3,A/  =0 


Fig.  5.20  Detection  probability.  Test  -  D4,  n  =  16,  A/  =0 
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Fig.  5.21  Detection  probability.  Test  -  D5,  K  =  1.14,  A /  =0 
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Fig.  5.23  Detection  probability.  Test  -  D2,  a=.225,A/  =.25 
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Fig.  5.24  Detection  probability.  Test  -  D5,  iiC =1.14,A/  =.25 
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Fig.  5.25  Detection  probability.  Test  -  Dl,  a=.225,Af  =.5 
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Fig.  5.26  Detection  probability.  Test  -  D2,  a=.225,Af  =.5 
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Fig.  5.27  Detection  probability.  Test  -  D5,  /C=1.14,A/  =.5 
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6.  ROBUST  RANK  TESTS  FOR  NARROWBAND  FADING  SIGNALS 

6.1  Introduction 

6.1.1  -  Motivation 

On  initial  reflection,  it  seems  somewhat  contradictory  to  investigate  robust  rank 
tests.  Since  rank  tests  are  generally  distribution-free  (DF)  under  the  null  hypothesis,  it  is 
reasonable  to  ask  if  their  detection  performance  are  not  already  inherently  insensitive  to 
the  distribution  shape.  It  turns  out  that  this  is  indeed  the  case,  but  we  have  not  found 
any  previous  work  pointing  to  that  conclusion.  Moreover,  it  will  be  shown  that  by  a 
slight  increase  in  complexity,  we  can  also  have  maximin  optimal  performance.  Other  rea¬ 
sons  that  motivate  us  to  study  tests  based  on  rank  statistics  are  the  following. 

i)  It  is  possible  to  design  rank  tests  that  attain  the  DF-CFAR  property  for  all  n  , 
under  relatively  weak  assumptions.  In  particular,  some  of  the  tests  that  will  be  studied 
in  this  chapter  only  require  that  the  observations  be  i.i.d.  under  the  null  hypothesis.^  By 
way  of  contrast,  the  various  robust  SSQME  tests  of  Chapters  4-5  are  only  asymptoti¬ 
cally  CFAR.  Though  the  simulation  results.  Section  5.4,  showed  very  good  CFAR  pro¬ 
perty  for  n  =  16,  this  is  not  true  for  all  n  ,  for  all  desired  level,  and  for  all  possible 
noise  contaminations.  For  instance,  in  order  to  achieve  <  10“^  with  contaminations 
of  €  ^  0.5,  it  was  found  that  trimming  levels  of  a  — f  0.5  are  required,  i.e.,  the  detector  is 
based  on  median  estimators.  This  leads  to  large  variances  as  indicated  by  the  asymp¬ 
totic  formula,  Eq.  (5.3).  The  variances  are  even  larger  for  small  n  ;  hence,  is  further 
degraded. 

ii)  With  the  SW-SSQME  tests.  Section  5.2a,  the  adaptive  threshold  estimator, 
^(u,  v)  of  Eq.  (5.5),  does  not  efficiently  utilize  the  total  number  of  available  noise  refer¬ 
ence  samples,  nM .  Actually,  the  ensuing  detectability  loss  is  a  function  of  M  and  not  of 

^  This  is  probably  the  most  important  feature  of  target  detection  systems  (radar,  sonar), 
where  initiation  of  some  action  process  based  on  false-alarm  might  bear  disastrous  results. 
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nM  (see  Eq.  (5.6)),  and  is  quite  large  for  small  M .  In  a  non-homogeneous  environment 
(e.g.,  clutter  “edges”  and  multiple  targets,  cf.  [20]),  M  must  be  kept  small  in  order  not 
to  violate  the  assumption  that  the  reference  samples  are  identically  distributed  as  the 
test  cell  samples.^  On  the  other  hand,  the  two-sample  rank  tests  that  we  study  use  the 
reference  samples  more  efficiently.  Furthermore,  it  is  even  possible,  with  proper  choice 
of  the  modulating  signal,  to  obtain  the  optimal  performance  of  the  locally  most  powerful 
(LMP)  detector  when  the  noise  p.d.f,  is  known,  and  to  design  for  a  maximin  robust  rank 
detector  (in  terms  of  )  over  a  mixture  class. 

iii)  With  the  scale  invariant  IS-SSQME  tests  which  are  not  based  on  reference 
samples.  Section  5.2b,  it  was  not  possible  to  obtain  exact  CFAR  and  maximin  P^ 
though  they  were  numerically  shown  to  be  robust  over  wide  parametric  families.  More¬ 
over,  although  these  tests  suffer  a  rather  small  detectability  loss  at  the  least  favorable 
p.d.f.,  the  simulation  study  has  shown  a  slow  convergence  to  asymptotic  theory.  (Intui¬ 
tively,  the  reason  for  that  is  the  necessity  to  estimate  two  parameters  from  the  data  set). 

The  rank  tests  which  are  proposed  and  studied  in  this  chapter,  essentially  avoid  the 
above  mentioned  problems.  On  the  negative  side,  some  of  the  tests  necessitate  increased 
storage  and  computational  loads.  Also,  the  various  SSQME  tests  are  all  based  on  shift 
invariant  estimators,  and  hence  once  the  convergence  to  the  asymptotic  theory  has  been 
practically  reached  under  Hq,  the  detection  performance  under  Hi  follows  immediately 
from  the  asymptotic  results,  for  all  signal  levels.  This  is  clearly  exhibited  by  the  simula¬ 
tion  results.  Section  5.4.  By  way  of  contrast,  the  proposed  rank  statistics  are  not  shift 
invariant,  and  our  asymptotic  derivations  are  valid  for  weak  signals  in  the  nonsingular 
detection  sense:  na^  c  .  While  most  of  the  rank  statistics  that  we  consider  could  be 
replaced  by  shift  invariant  rank  estimators^  see  [3],  without  affecting  the  efficacy 
mance,  this  will  again  necessitate 


"  The  adaptive  threshold  W{u,  v)  could  be  robustified  against  interfering  signals  and  clutter  edges 
in  a  manner  similar  to  that  of  our  previous  work  [20],  and  similar  improvements  can  be  anticipated. 
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performance,  this  will  again  necessitate  robustly  estimating  their  variance.  Thus,  we  will 
at  most  obtain  the  performance  of  the  SSQME  tests,  but  with  increased  complexity. 

6.1.2  -  Preview 

In  Section  6.2,  the  available  observations  structure  for  typical  radar/sonar  applica¬ 
tions  is  presented,  and  some  rank  tests  are  classified  according  to  the  ranking  structure. 
One-sample  tests  do  not  require  noise-reference  observations,  while  two-sample  tests  are 
based  on  ranking  the  entire  data-set:  the  observations  obtained  from  the  test  cell  under 
question,  plus  these  which  originate  from  a  noise  reference.  In  single-sweep  tests,  each 
observation  (“sweep”)  from  the  test  cell,  is  ranked  with  respect  to  its  contemporary 
noise  reference  observations,  and  the  final  test  statistics  is  a  function  of  the  single  sweep 
ranks,  accumulated  over  the  illuminating  pulse  train  period.  The  various  rank  tests  thus 
represent  different  levels  of  performance  vs.  complexity.  Previous  relevant  results  on 
rank  statistics  are  also  summarized. 

In  Section  6.3,  various  asymptotically  optimal  tests  for  detection  of  fading  nar¬ 
rowband  bandpass  signals  are  derived,  when  the  noise  p.d.f.  is  known  only  in  shape,  i.e., 
any  shift  and  scale  transformations  do  not  affect  their  performance.  The  model  is  given 
by  the  p.d.f.  under 


n  (M+1) 


n 


f  (x,  y)  =  n  /  {Xi  -  aSi  A  COS0)  f  (t/,-  -  aSiAsixid)  H  /  (j,- )  /  (y,- )  (6.1.1) 

t  =1 


where  A  is  the  random  signal  amplitude,  normalized  such  that  represents  the  unk¬ 
nown  input  SNR.  {s,- }  is  a  known  signal  sequence,  and  6  is  the  random  phase.  The 
samples  are  gathered  such  that  the  test  cell  samples  are  in  the  first  n  places.  M 
represents  the  number  of  noise-reference  vectors,  so  there  are  nM  noise  alone  observa¬ 
tions.  The  X  and  y  samples  are  ranked  separately,  such  that  R^i  is  the  rank  of  a:,- 
among  {x,-  }f^i,  N  —  n  [M  -j-  1),  and  likewise  R^i  ,  (for  a  one-sample  problem, 
M  —  0).  The  class  of  tests  that  we  study  is  of  the  form 


(6.1.2) 


-  lic¬ 


it  is  shown  that,  for  arbitrary  distribution  of  A  ,  this  quadrature  structure  asymptoti¬ 
cally  maximizes  for  the  model  (6,1.1);  that  for  optimality  =  5,-;  and  there  exists 
an  optimal  score  function  ai^[i  J  ),  related  to  /  .  In  fact,  it  is  the  same  score  function 
which  was  found  by  Hajek  [61]  for  optimal  detection  of  a  lowpass  deterministic  signal. 
It  should  be  emphasized  that  the  class  of  tests  (6.1.2)  is  DF-CFAR  with  the  weak 
assumption  that  {xj ,  are  i.i.d.  under  No  symmetry  assumption  on  /  is  neces¬ 
sary,  and  our  test  does  not  utilize  the  sign  of  the  observations,  as  has  been  common  in 
previous  works,  cf.  [32],  [40]. 

In  communication  and  target  detection  systems,  the  known  part  of  the  signal 
sequence,  {5^* }  is  usually  at  the  control  of  the  system  designer.  The  most  interesting 
result  of  this  section  is  that  if  a  zero  mean  {5,-  }  is  chosen,  the  performance  of  an  optimal 
one-sample  test,  is  identical  with  that  of  the  parametric  LMP  for  completely  known  /  , 
and  no  reference  samples  are  needed.  Furthermore,  this  is  simply  achieved  with  phase 
shifting  by  tt  half  of  the  transmitted  pulses,  in  an  arbitrary  order.  The  optimal  statistics 
for  each  of  the  channels  in  (6.1.2)  then  take  the  form 

^(x)=  E  E  0.„{Rni)  (6-1-3) 

=0  <f>^  =7r 

and  likewise  for  the  y  observations.  For  single-sweep  ranking,  the  optimal  test  is  of  the 
form 

n  J  ^  n 

E  (M  -f  l)i )  +  E  “M  +  l(-R  fM+l)» ) 

i=l  J  [i  =  l 

where  here  is  the  rank  of  a:,-  in  {xi ,  x^+i,  X2n+i,  ■  ■  ■  ,  ^Mn+i}-  The 

large  saving  in  ranking  complexity  is  compromised  for  reduced  detection  performance, 
since  here  only  if  in  addition  M— ^oo  the  performance  of  the  optimal  one-sample  test  is 
achieved.  The  section  concludes  with  various  numerical  comparisons  of  the  detectors. 
In  particular,  the  superiority  over  the  SW-SSQME  test  (Chapter  5)  when  the  number  M 
of  reference  vectors  is  small,  is  demonstrated. 


>  ‘ 
< 
Ho 


(6.1.4) 
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Traditionally,  rank  tests  have  been  designed  to  achieve  a  DF-CFAR  property,  and 
then  the  detection  probability  was  analyzed  at  one  point  in  the  noise  distribution  space, 
usually  at  the  Gaussian,  cf.  [21],  [32],  [40],  [52-55].  In  Section  6.4,  we  study  the  detection 
performance  of  the  various  proposed  tests,  assuming  the  uncertainty  in  the  noise  p.d.f.  is 
modeled  by  a  mixture  family.  Common  score  functions  are  analyzed  -  median,  Wilcoxon 
and  normal  scores.  It  is  found  that  the  quadrature  rank  tests  with  these  score  functions 
are  inherently  robust  from  a  detectability  point  of  view.  In  fact,  in  the  worst  case, 
degrades  only  as  some  function  of  (1  -  e),  whereas  heavily  tailed  contaminations  hardly 
influence  the  performance.  This  is  in  sharp  contrast  with  the  parametric  UMP  scale 
invariant  detector  for  fading  narrowband  signal  in  Gaussian  noise.  Chapter  2.2,  where 

>  0  even  for  small  t  when  the  variance  of  the  contaminating  p.d.f.  is  very  large. 

In  Section  6.5,  maximin  robust  rank  tests  for  the  model  (6.1.1)  are  found,  when  the 
noise  p.d.f.  belongs  to  a  mixture  family.  The  structure  is  identical  with  that  of  the 
optimal  tests  of  Section  6.3,  but  now  the  score  function  ayv(2,/o)  is  derived  from  the 
least  favorable  distribution  /  q,  i.e.,  the  one  that  minimizes  Fisher’s  information.  In  a 
similar  manner  to  the  SSQME  tests,  a  soft  limiter  is  utilized,  but  here  it  operates  on  the 
nominal  score  function  of  the  ranks,  and  not  on  the  original  observations.  The  proof  of 
maximin  optimality  parallels  that  for  rank  estimators  [3],  as  the  performance  of  both  is 
proportional  to  the  same  functional.  As  usual,  the  efficiency  at  the  nominal  distribution 
is  sacrificed  somewhat  in  order  to  achieve  overall  good  behavior.  No  increase  in  imple¬ 
mentation  complexity  over  the  traditional  rank  tests  is  incurred  by  the  maximin  robust 
test,  as  it  is  required  only  to  store  a  different  vector  in  the  look-up  table. 

Section  6.6  extends  the  previous  tests  to  handle  an  unknown  signal  frequency,  a 
common  problem  which  is  always  associated  with  detection  of  coherent  signals,  originat¬ 
ing  from  targets  of  unknown  velocity.  For  the  one-  and  two-sample  ranking,  we  propose 
a  structure  similar  to  that  of  Chapter  5.3  -  first  n  Doppler  channels  are  generated  by 
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the  usual  “butterfly”  transformation,  and  then  a  robust  rank  test  is  performed  in  each  of 
the  Doppler  channels.  It  is  shown  that  the  asymptotic  SNR  loss,  for  signals  whose  fre¬ 
quency  is  not  matched  to  either  of  the  discrete  down-conversion  frequencies,  is  identical 
with  that  of  the  DFT  detector.  For  the  single-sweep  structure,  an  even  more  promising 
result  is  found.  It  is  possible  to  achieve  roughly  the  same  asymptotic  performance  of  the 
one-  and  two-sample  tests,  when  the  order  of  frequency  conversion  and  ranking  is  inter¬ 
changed!  First,  the  test  cell  observations  are  ranked  with  respect  to  their  adjacent  noise 
reference  observations,  and  then  an  FFT  is  performed  on  the  complex  vector  of  the 
scores  of  the  ranks.  Thus,  roughly,  the  complexity  is  that  of  the  FFT  algorithm, 
0{n\ozn),  compared  to  0{n-\ogn)  of  the  one-sample  and  SSQME  Doppler  detectors. 
The  major  disadvantage  is  that  fairly  large  M  is  required  for  small  losses,  thus  the 
single-sweep  rank  Doppler  detector  is  less  appropriate  than  the  one-sample,  in  a  nonho- 
mogeneous  environment. 

Finally,  Section  6.7  presents  some  of  the  important  results  of  an  extensive  Monte- 
Carlo  study.  While  convergence  to  the  asymptotic  theory  is  slower,  in  comparison  to  the 
SSQME  tests,  it  has  been  found  that,  roughly  for  n  >  32,  most  of  the  conclusions  of 
previous  sections  are  valid.  In  general,  for  small  number  of  reference  noise  vectors  M , 
the  robust  rank  detectors  achieve  a  better  than  the  robust  SSQME  tests. 

As  a  consequence  of  the  theoretical  and  simulation  results,  our  tentative  conclusion 
is  that  the  proposed  detectors  emerge  as  more  than  adequate  for  successfully  treating 
most  of  the  uncertainties  that  are  encountered  in  the  real-world  radar/sonar  detection 
environment.  Furthermore,  the  current  developments  in  VLSI  and  VHSIC  technologies 
indicate  that  implementation  in  typical  systems  can  be  considered  practical. 
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6.2  Definitions  and  Preliminaries 

6.2.1  -  The  Available  Observations  and  Classification  of  Rank  Tests 

Denote  by  {x^^j  the  samples  of  the  received  waveform  which  contain  the  infor¬ 
mation  about  the  useful  signal  if  it  is  present,  and  by  {xj-y  j  —  1,...,M,  the  sam¬ 

ples  from  a  noise  reference  channel.  Later,  we  shall  focus  on  narrow-band  processes, 
where  x  represents  both  the  in-phase  and  quadrature  components. 

As  a  typical  example  of  radar  and  active  sonar  systems,  the  transmitted  waveform 
is  an  n-long  high  frequency  pulse  train,  where  the  pulse  repetition  interval  (PRI)  is  T, 
and  nT  is  the  total  dwell  time  of  the  antenna  beam  at  a  given  spatial  direction.  The 
received  waveform  is  passed  through  a  filter  which  is  matched  to  a  single  pulse  in  the 
train,  and  the  effective  pulse  width  at  its  output  is  r.  The  {x^j}  are  obtained  by  sam¬ 
pling  the  matched  filter  output  every  r  seconds  and,  if  the  possible  target  location  is  at  a 
delay  of  from  the  receiver,  we  have 


^io  =  +  {i  -  l)r) 


x{to  -  (M/2  -  j  +  l)r+  [i  -  l)r) 
+  U  -  M I2)t  +  (2  -  l)r) 


(6.2.1) 

j  =  1,....  M/2 
j  =  M/2  +  1,...,  M 


In  this  way,  the  M  length  window  in  the  range  coordinate,  which  is  spanned  by  the 
resolution  cells  adjacent  to  the  possible  target  location,  supplies  the  noise  reference  chan¬ 
nel.  The  detection  process  is  basically  an  L  =  T/r  hypothesis  problem,  where  the 
hypothesis  (presence  of  a  target  at  t^  =  kr)  is  checked  sequentially  by  sliding  the  above 
window  in  the  range  coordinate.^ 

In  this  chapter,  we  shall  always  assume  that  the  observables  are  i.i.d.  across  the  i 
index  (sampling  time)  for  a  given  j  index  (“range  time”).  On  the  other  hand,  the 
assumption  of  identical  distribution  across  the  j  index  is  less  valid  in  some  situations, 

^  Sampling  in  other  coordinates  (frequency,  spatial  direction)  is  also  possible,  cf.  [18],  and  will  lead 
to  identical  results  as  long  as  the  assumptions  about  the  distribution  of  the  reference  samples  are 
identical. 
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though  it  might  appear  strange  as  x;y(/  ^  j)  is  more  separated  in  time  from  i.y  than 
^ik{^  7^  ;)•  This  is  so  because  the  dominant  noise  sources  in  radar  (sonar)  systems  are 
due  to  clutter  (reverberation)  reflections  of  the  transmitted  waveform  from  the  environ- 
ment  (ground,  sea,  precipitation  clouds,  etc.).  For  a  given  j ,  they  originate  from  the 
same  spatial  cell,  and  stationarity  can  often  be  assumed  as  the  typical  dwell  time  (0(10 
milliseconds))  is  too  short  for  substantial  change  of  the  reflection  characteristics  of  that 
cell.  On  the  other  hand,  samples  with  different  ;  originate  from  different  spatial  cells, 
and  the  variation  of  the  reflection  characteristics  is  often  large  in  nonhomogeneous 
environments.  The  choice  of  the  reference  window  length  M  is  an  important  design 
parameter.  It  reflects  the  designer’s  compromise  between  smaller  losses  at  an  homogene¬ 
ous  environment  with  large  M ,  and  the  losses  due  to  violation  of  the  inherent  assump¬ 
tion  that  the  reference  window  properly  represent  the  noise  in  the  detection  cell,  c.f.  [20|. 

Various  rank  test  structures  are  possible,  corresponding  to  the  subset  of  {x^j] 
which  is  chosen  to  be  ranked,  see  Figure  6.1. 

Figure  6.1 
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a)  Rank  Tests  for  Two-Sample  Situations 

Sticking  to  the  terminology  of  the  statistical  literature,  “two-sample”  tests  are 
based  on  ranking  of  the  entire  data  set.  The  observables  {x^j  }  are  pooled  in  a  vector  z 
of  length  N  =  n  (M  -|-  1),  such  that  the  suspected  signal  samples  are  in  the  first  group, 
i.e.,  2iS;„+i=  k  =  0,1, ...M,  z  is  then  ordered  and  the  vector  of  ranks  r  is  generated. 
Let  be  the  sample  in  the  ordered  vector,  and  r,-  the  rank  of  ,  i.e., 

,  i  =l,2,...n{M+l)  ^  N  (6.2.2) 

Naive  computation  of  r,-  can  be  done  with  0  {N^)  operations  according  to 

N 

=  S  U{zi  -  Zj)  ,  i  =l,...,N  (6.2.3) 

y=i 

where  (/(•)  is  the  unity  step  function.  In  practice,  random  sorting  algorithms  like  the 
QUICKSORT  [39]  are  preferred,  as  they  require  0[N\ogN)  operations.  Throughout,  we 
assume  that  the  p.d.f.  of  is  continuous;  thus,  the  r,-  are  well  defined  and  unique. 
The  statistic  =  r,-(z)  will  be  called  the  rank  of  ,  and  Ryy  will  denote  the  vector  of 
ranks  {Rpfi,  •  ■  ■  ,  Raw)-  simple  linear  rank  statistic  for  the  two-sample  situation  is 
defined  as 

ryv(Rjv)=  Yi  (6.2.4) 

i=l 

where  c  and  ajv  are  some  N  dimensional  vectors.  Notice  that  once  the  data  has  been 
ranked,  computation  of  Tyv  is  simpler  than  that  required  for  the  LMP  statistics  of  the 
form  ^  /  (x,- ),  as  it  is  only  necessary  to  store  in  a  look  up  table  the  N  values  of  . 

Under  the  alternative,  when  the  SNR  is  sufficiently  large,  the  ranks  of  the  signal 
samples  {z,o}  will  tend  to  be  the  largest,  thus  Tn  will  assume  larger  values  than  it  has 
under  the  null  hypothesis.  We  shall  see  that  c  and  a^  can  be  chosen  to  optimize  the 
detection  process.  In  general,  for  a  constrained  signal  structure,  two-sample  rank  tests 
will  result  in  the  best  performance,  but  are  also  the  most  demanding  in  terms  of 
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implementation  complexity.  Furthermore,  when  the  inherent  assumption  of  identical 
distribution  of  the  reference  samples  and  the  signal  samples  is  not  valid,  the  test  will  not 
be  CFAR.  Also,  large  detectability  losses  can  occur  if,  for  instance,  the  noise  power  in 
the  reference  samples  is  higher  than  in  the  signal  samples. 

Rank  tests  of  this  type  which  are  appropriate  for  radar  applications  have  been  stu¬ 
died  by  Hansen  and  Olsen  [53],  Zeoli  and  Fong  [54],  and  by  Al-Hussaini,  Badran  and 
Turner  [55];  see  also  [21]  and  [56]  for  extensive  bibliographies.  These  works  were  limited 
for  special  non-optimal  nonparametric  tests  (basically,  modifications  of  the  Wilcoxon, 
Mann-Whitney  and  Savage  statistics),  and  for  noncoherent  radar  processing.  The  reason 
for  that  is,  probably,  that  with  coherent  signals,  a  “bank  of  Doppler  tests,”  as  was  dis¬ 
cussed  in  Section  5.3,  is  necessary  when  the  signal  frequency  is  unknown.  Therefore,  the 
complexity  of  coherent  rank  tests  increases  by  a  factor  of  n  ,  when  compared  to  non¬ 
coherent  processing.  However,  as  a  widely  applicable  rule,  the  efficacy  of  coherent  pro¬ 
cessing  is  proportional  to  na^  (where  is  the  input  SNR),  whereas  it  is  proportional  to 
na^  with  noncoherent  processing,  hence  substantially  inferior  for  small  SNR  (see  Appen¬ 
dix  H).  Also,  the  above  mentioned  works  have  been  mostly  concerned  with  the  perfor¬ 
mance  for  Gaussian  derived  noises,  for  which  the  studied  test  statistics  can  be  shown  to 
be  non-optimal. 

b)  Rank  Tests  for  the  One-Sample  Situation 

In  this  case,  only  the  “signal”  observables  are  available,  In  radar-sonar 

systems,  this  restriction  can  be  enforced  when  the  environment  is  known  to  be  non¬ 
stationary,  and  no  reliable  assumptions  concerning  the  statistical  properties  of  the  sam¬ 
ples  across  the  reference  window  can  be  made.  Another  useful  application  of  one-sample 
tests  is  in  a  multiple  target  situation,  where  a  presence  of  another  large  signal  (e.g.,  the 
result  of  airplanes  flying  in  formation),  can  cause  havoc  when  a  two-sample  test  is  used, 
cf.  [20].  Traditionally,  one-sample  nonparametric  tests  have  been  based  on  the  rank  vec¬ 
tor  R,^  of  the  magnitude  of  the  observations:  Rj'  =  rank  of  |  x^q  j  in 
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(  I  a;  10  I  1  ar^o  I  ),  and  on  the  vector  of  signs 

S„)  =  2  sign{xio)a„{R„t)  (6.2.5) 

i  =1 

To  achieve  the  DF-CFAR  property  under  the  hypothesis,  it  must  be  assumed  that 
the  {xjo}  have  symmetric  distribution.  Application  for  narrowband  coherent  signals  was 
studied  by  Hansen  [40]  with  the  Wilcoxon  scores  a„  )  =  /?„,• .  In  this  chapter,  we 
also  study  statistics  which  do  not  depend  on  the  sign  vector 

Tn  (Rn)  =  E  Ci  {R„i )  (6.2.6) 

1=1 

Here,  no  assumption  of  symmetry  is  required  to  obtain  CFAR;  hence  these  tests  are 
inherently  more  robust  than  the  rank-signed  tests,  (6.2.5).  However,  in  order  to  achieve 
satisfactory  detection  performance,  it  is  necessary  to  apply  a  particular  modulation  of 
the  transmitted  signal  set  {s,-  },  in  a  rather  nonrestrictive  way.  Perhaps  surprisingly,  it 
turns  out  that  it  is  possible  to  achieve  better  performance  than  that  of  the  two-sample 
tests,  and  actually  the  performance  of  the  LMP  parametric  detector  is  reached  asymp¬ 
totically.  At  the  same  time,  the  ranking  complexity  is  substantially  reduced.  With 
QUICKSORT,  the  reduction  is  by  a  factor  of  (M+l)[l+log2(M-bl)/log2(n )]  ^=^9  for 
n=32,  M=5. 

c)  “Single-Sweep”  Rank  Tests 

Here,  for  each  information-gathering  “sweep”  i,  the  observables  are 

ranked  separately,  i.e.,  R(m+i){  =  rank  of  in  see  Figure  6.1.  The  test 

statistic  is 

n 

+  ~  S  (A/+l)t  )  (6.2,7) 

i  =1 

This  test  can  be  considered  as  a  special  case  of  a  two-sample  problem  for  random¬ 
ized  blocks  [50],  or,  in  accordance  with  the  terminology  of  Feustel  and  Davisson  [57-58] 
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as  a  “mixed”  test.  Application  to  nonparametric  radar  detection,  with  noncoherent  pro¬ 
cessing  and  Wilcoxon,  Savage  and  rank-squared  scores,  have  been  studied  in  [52-55]. 
These  tests  minimize  both  the  ranking  complexity  and  the  storage  requirements.  From 
sweep  to  sweep,  it  is  necessary  to  keep  only  a  single  number,  a  {R^ ),  compared  to  storage 
of  n  and  n{M+l)  observables  with  the  one-  and  two-samples  tests,  respectively,  before 
the  test  statistic  can  be  computed.  ^  Further,  the  ranking  complexity  is  linear  in  M,  since 
only  a  single  rank  is  needed  for  each  sweep.  For  noncoherent  processing  in  Gaussian 
derived  noise,  the  detectability  losses  compared  to  the  corresponding  two-sample  tests 
are  typically  negligible  if  M  >  8,  see  [53-55].  This  is  probably  the  reason  for  the  wide 
applicability  of  these  tests  in  modern  noncoherent  radar  processing. 

For  a  non-stationary  environment,  the  usefulness  of  these  tests  is  restricted,  as  M 
must  be  kept  small,  due  to  the  same  reasons  that  were  discussed  with  regard  to  the 
two-sample  tests. 

We  will  show  later  that  for  coherent  narrowband  signals,  the  relative  loss  between 
the  single-sweep  and  the  optimal  one-sample  test  is  considerable,  when  M  is  restricted  to 
be  small  in  order  to  preserve  the  stationarity  assumption. 

6 ,2. 2  -  Some  Previous  Results  on  Rank  Statistics 

In  this  subsection,  we  quote  and  summarize  some  known  properties  of  rank  statis¬ 
tics  which  are  needed  in  the  sequel.  Except  for  the  asymptotic  distribution  results,  on 
which  most  of  the  results  of  subsequent  sections  are  based,  the  proof  of  the  quoted  facts 
is  pretty  straightforward.  For  a  complete  exposition,  see  Hajek  and  Sidak  [49]. 

i)  Hypothesis  Hq 

Under  Hq,  the  observations  {jj  are  assumed  to  be  i.i.d.  according  to  some 

The  total  storage  for  the  entire  detection  range  should  be  multiplied  by  the  number  of  range  cells 
L  =  T  /r,  see  the  discussion  following  Eq.  (6.2.1).  L  is  between  100-1000  for  typical  radar  sys¬ 
tems. 
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arbitrary  one-dimensional  p.d.f.  /  (•).  Let  be  the  rank  vector  of  X  and  r  some  point 
in  the  space  'P[N)  of  all  permutations  of  (l,2,...,A^).  Then 

Theorem  6.2.1 

Pr  {R^  =  r}  =  ^  ,  re  P(Ar)  (6.2.8) 

Pr  {Rm  =  =  ^  >  1  <  i  (6-2.9) 

Pr  ^  ;  Rn]  =  i}  —  1  i  ^  ^  (6.2.10) 

and  so  on  for  the  other  joint  distributions.  Notice  that  these  are  independent  of  the  par¬ 
ticular  /  (•),  thus 

N 

Corollary  6.2.1  Any  test  based  on  the  statistic  Tn  “  Y]  ^Ni^Ni)}  is  distribution 

1 

free  under  (DF-CFAR). 

Define 


a  =  {1/N)  E  ,  c  =  {l/N)  E  c,-  ,  =  (l/(iV  -  1))  E  («(0  -  af 

t=l  J=1 

Then 

Theorem  6.2.2  Under  Hq, 

ETj^  =Na*c‘  ,  VarT^  =  <^a  Zl  (6.2.11) 

2=1 

ii)  Hypothesis  (Symmetry) 

/  (a: )  is  an  arbitrary  one-dimensional  symmetric  p.d.f., 
f  [x)  =  f  (-x),  X  G  (-00,  oo),  and  the  samples  {xi}/Yi  are  i.i.d.  Let  Ryy  be  the  rank 
vector  of  the  absolute  values  of  the  observations,  S  be  the  vector  of  the  signs  of  the 
observations,  and  D  be  a  vector  such  that  =  ±  1.  Then, 

Theorem  6.2.3  Under  Hi^  for  arbitrary  symmetric  /  (•),  the  random  vectors  and  S 
are  mutually  independent  and 


Pr  {S  =  D}  =  (i)« 


P,{R^  =  r}  =  - 


WT  ■  ■•ePW 


(6.2.12) 
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N 

Corollary  6.2.2  Any  test  based  on  the  statistic  A  sign{Xi),  is  DF- 

1 

CFAR  under  H i. 

Theorem  6.2.4  Under  Hi, 


ETrr  =  0  ,  Var  T/ ==  f]  a\i) 

t==i 

iii)  The  (^-Functions  and  Scores 

Let  F{x)  be  the  distribution  of  /  ,  and  F~\u  )  its  inverse.  Define 


(6.2.13) 


0  <  u  <  1 


(6.2.14) 


Lemma  6.2.5  If  /  (•)  is  strongly  unimodal  (i.e.,  -log/  [x]  is  convex  on  the  support  of 
F  (x  )),  (f>{u  ,  /  )  is  non-decreasing. 


Lemma  6.2.6  If  Fisher’s  information  /(/  )  <  oo, 


1 


/(/)  =  /  <t>\u,f)du 
0 


(6.2.15) 


Define  the  scores  for  a  density  /  (•)  by 

J  =  "(/-/)/ f  di  {6.2,16) 

j  -OO 

where  are  the  order  statistics  of  x. 

In  a  similar  manner  for  Hi^ 

4>+{u  ,f  )  A  f  )  (6.2.17) 

-/(■'./)  A  y[|';  I, (<’■2.18) 


Theorem  6.2.7  If  J  |  /  '(z )  |  da;  <  oo,  then 


N 


E  a^{i,f)  =  0  ,  i  E  «iv(u/)  <  /(/) 

t=l 


(6.2.19) 
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and  similarly  for  a^v”.  As  N  co,  the  left-hand  side  of  the  second  equation  converges 
to  I{f  ). 

We  shall  refer  to  the  )  as  the  optimal  scores  (for  a  known  f  ).  Let  J{u)  be 

some  arbitrary  function,  bounded  and  square  integrable  on  [0,l].  Approximate  scores  are 
generated  according  to 

«iv(0  =  /{N  +  1))  (6.2.20) 

or 

ijN 

api{i)  =  N  f  J{u)du  (6.2.21) 

The  ajv(')  are  said  to  converge  quadratically  to  J{u),  denoted  as  a^(')  /  if 

1 

lim  f  [a^(l  +  [uiV])  - /(u)]2du  =  0  (6.2.22) 

TV  — »•  oo  Q 

where  [a:]  is  the  largest  integer  in  [0,a:]. 

Theorem  6.2.8  If  a^[i)  satisfy  either  (6.2.20)  or  (6.2.21),  Also, 

)•  The  implication  of  this  theorem  is  that,  asymptotically,  ayv(2,/  ) 
can  be  replaced  by  the  approximate  scores  ayv(0»  J{u)  =  <^(u  ,/  ),  see  [49,  pp. 

155-166]. 

iv)  Asymptotic  Normality  of  Linear  Rank  Statistics 

The  asymptotic  distribution  of  simple  linear  rank  statistics  is  by  no  means  a 
straightforward  consequence  of  the  central  limit  theorem  -  notice  that  the  components  of 
the  rank  vector  are  neither  independent  nor  identically  distributed.  The  question  of 
asymptotic  normality  of  rank  statistics  has  been  treated  by  many  theoretical  statisti¬ 
cians,  cf.  Chernoff  and  Savage  [59]  and  the  other  references  cited  there.  The  following 
deep  theorems  by  Hajek  [49,61],  who  drew  on  an  earlier  work  of  LeCam  [60],  are  the 
most  general  for  our  purposes. 
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Consider  a  shift  alternative  to 


N 


f  i(x)  =  n  /  (a:,-  -  di)  ,  I{f  )  <  OO, 


i=l 


(6.2.23) 


where  the  {d^ }  satisfy  Noether^s  conditions 


N 


lim  5]  {di  -  dy  =  <  OO 


iV  OO 


lim 


E  [di  -  df 

i=l 


iv  OO  max  id:  -  d) 

l<i<N 


=  OO 


(6.2.24) 


(6.2.25) 


Notice  that  these  conditions  constitute  a  special  definition  of  a  “small  signal,”  which  is 
an  example  of  the  “contiguity”  (by  LeCam’s  notion)  of  /  ^(x)  to  /  o(x).  In  particular, 
for  either  a  zero  mean  signal  =  (-1)*  d,  or  for  a  constant  signal  in  the  “signal”  sam¬ 
ples  di  =  d{l-U{i-n-l)),  (6.2.25)  is  automatically  satisfied.  The  condition  (6.2.24) 
reduces  to  Nd^  ►  b  or  n{l—n/N)d‘^ — >-6.  Thus,  the  regularity  condition  for  non¬ 
singular  detection  is  sufficient,  and  d  — >  0  alone  is  not  necessary. 

We  are  interested  in  the  distribution  of 


N 

Tn  (^iV  t  ^)  Xj  Ni  )  ~  ■  C 

i=l 


(6.2.26) 


where  a;v(-)  J . 


Theorem  6.2.9  Under  the  alternative  (6.2.23),  when  the  conditions  (6.2.24)  and  (6.2.25) 
hold,  the  rank  statistic  (6.2.26),  with 


N 

EiCi-c)^ 

lim  - -  =  OO 

N  -*  CO  max  (c;  -  c  Y 


VS  asymptotically  W(/r^c  >  where 


(6.2.27) 


A 


[E  (c.  -  c  ){di  -  d)]  J  [J{u  )<p{u  ,f  )du 
1  0 


(6.2.28) 
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N  ^  1 

^0  ^  E  i^i  -  I  [^(^)  -  ^]^du  ,  7  J  /  J(u)du  .  (6.2.29) 

10  0 

and  (i>{u  jf  )  is  given  by  (6.2.14).  The  same  is  true  under  Hq  where  =0. 

n 

For  H ij  with  alternatives  11  /  (x,-  -  d )  (recall  that  now  /  is  symmetric),  and 

i  ==:! 

nd^  — ^  6^  <  oo,  similar  result  hold  for  the  distribution  of  rank  signed  statistics. 


Theorem  6.2.10  The  statistic  (6.2.5)  with  a„(-)*^/,  is  asymptotically  A^(/i^,cr^), 
where 


^  dn  J  J(ti )  ^'^(u  ,f  )du 
0 

~  ^  ^  f  )du 

1  0 

where  jf  )  is  given  by  (6.2.17). 


(6.2.30) 

(6.2.31) 


6.3  Asymptotically  Optimal  Quadrature  Rank  Tests  for  Narrowband  Fading 
Signals  in  Known  Noise 

6.3.1  -  Two-Sample,  One-Sample  and  Single-Sweep  Optimal  Tests 

Asymptotically  optimal  rank  tests  for  detection  of  additive  deterministic  signals 
have  been  studied  by  Hoeffding  [62],  Capon  [63],  and  Hajek  [61].  The  results  of  [61] 
actually  show  that  the  rank  information  is  asymptotically  sufficient,  i.e.,  the  optimal 
rank  tests  achieve  the  best  possible  performance.  As  in  Chapter  4,  for  narrowband  fad¬ 
ing  signals  with  random  parameters,  we  optimize  the  performance  within  a  given  family 
of  tests.  In  view  of  our  previous  results  and  Appendix  E,  it  is  not  surprising  that  the 
optimal  ensuing  test  is  composed  of  the  test  statistics  that  are  also  optimal  for  deter¬ 
ministic  signals,  and  has  the  structure  of  a  quadrature  test. 
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The  joint  p.d.f.  under  the  alternative,  for  the  observations  structure  of  Section 
(6. 2.1. a),  is 

n  N 

f  (x,y)  =  n  /  {xi-aSiA  cose)  f  {yi~aSi  A  sine)  n  /  (x;)/  (y.  )  (6.3.1) 

where  ^  is  a  positive  r.v.  with  arbitrary  distribution,  e  is  C/[0,27r],  {sj  }  is  a  known  sig¬ 
nal  sequence  with 

s  ^  1  E  S;  <  oo  ,  A  1  ^  2  _  1  (g  3  2) 

«  i=i  ^  .=1 

The  parameter  a  corresponds  to  the  unknown  SNR  and  N—n{M+l).  f  (•)  is  a  p.d.f. 
of  known  shape,  i.e.,  any  shift  and  scale  transformation  f  {x)  (1/^)  /  ((^  -  m)/^)  is 

allowed,  as  the  rank  vector  is  invariant  for  it. 

The  following  tests  will  be  CFAR  for  anj/  f  (•)  [see  corollaries  6.2.1  and  6.2.2]  and 
will  optimize  the  detection  performance  for  the  particular  /  (•). 

Define  a  class  of  tests 

Qn{^N7  ^n)  ~  Qn{^n{^n'7  ^A^))  (6.3.3) 

where  is  the  vector  of  ranks  for  the  pooled  observations  {x^  is  likewise  for 

{y.- <^Ni-)^J{u),  J{u)  is  quadratic  integrable  on  [0,1],  c  is  some  vector  that 
satisfies  (6.2.27),  and  T^[-,  c,ajv)  is  a  linear  rank  statistic  given  by  Eq.  (6.2.26). 


Proposition  6.3.1  Under  the  alternative  (6.3.1)  for  Hq,  when  only  the  shape  of  /  is 

known,  if  lim  a^n  =  b  <  oo,  the  asymptotically  optimum  test  in  the  class  (6.3.3)  is, 
n  00 

for  any  distribution  of  A  , 


> 


Yj  {^Ni  J  /  )]  +  [  S  A  7  /  )]  <  (^) 


(6.3.4) 


where  a^{i  J  )  —>-J(u)  —  <i>{u  ,f  )  according  to  either  (6.2.16),  (6.2.20)  or  (6.2.21)  and 

1 

-)  f  J\u  )du 


M+1 


t„  (a)  =  -21ogQ;  7z  (1  - 


(6.3.5) 
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In  particular,  the  optimum  detection  probability  for  a  Rayleigh  signal  with  A  ^ 

,s)  =  ■*'  >®)l 

where  the  “effective  integrated  SNR”  is  defined  as® 


1  is 
(6.3.6) 


SNR  (f  ,s)  =  fl  -  , 

^  2  ^  (M+1) 


ni) 


(6.3.7) 


Proof  We  first  need  to  show  asymptotic  normality  of  the  in-phase  and  quadrature  rank 
statistics.  Condition  (6.2.24),  specialized  to  the  model  (6.3.1)  with  ^=1,  reduces  to 


Also,  Eq.(6.2.25)  becomes 


■)  lim  na^  <  oo 

n  —>00 


lim  - rt^l-sy{M+l)) - _  ^ 

n-^oo  max{(s,--s  /{M+l)f,  s‘^/{M+lf} 
i 

Thus,  the  conditions  of  Theorem  6.2.9  are  satisfied,  c,a^)  and  c,a^) 

are,  conditioned  on  A  and  9,  jointly  Gaussian  and  independent.  Substituting  the  qua¬ 
drature  shift  parameters  of  (6.3.1)  into  (6.2.28)  yields 


E  {rjv(Rjv;  c,a^)  \A  ,9}  =  aA  cos9  (c,-  -  c  )(d,-  -  d)  J  J[u  )4>{u  J  )du(6.3.8) 

t  =  l  0 

N  _  1 

E  {R^]  c,a.i^)  \  A  ,9}  =  aA  sm9  ^  (cj  -  ~)(di  -  d)  f  J(u  ,f  )du  (6.3.9) 

t=l  0 

where  d,- =s,- (l-{7(z -n -1)),  d=(n/N)'s,  and  }— *•  l^o{  }  a-re  identical  for  both 

channels  and  are  given  by  (6.2.29).  As  in  the  proof  of  Proposition  4.1  (see  Chapter  4.2, 
Eq.  (4.16)  and  the  following),  the  UMP  test  statistic  in  the  class  (6.3.3),  for  a  given  J (u  ) 
and  c,  is  given  by  ^^^(Rjy;  c,  a^)  4-  Ti^(R^]  c,  a^).  This  is  true  for  any  distribution 
of  A  since  the  likelihood  ratio  is  a  monotone  function  of  this  statistic  for  any  positive 
A  .  It  remains  to  optimize  c  and  /(•).  As  in  Chapter  4.2  (see  also  Appendix  E),  the 


®  Notice  that  SNR{f  ,a)/{na’^/2)  coincides  with  the  usual  definition  of  the  efficacy  for 
asymptotically  Gaussian  test  statistics.  See  also  Appendix  E. 
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asymptotic  detection  probability  is 


EaQ 


c,a^)  U  ,0}  4-  c,a^)  |  A  ,6} 


-11/2 


(6.3.10) 


,  \/-21oga 


where  (5(-,-)  is  Marcum’s  Q -function  (notice  that  d  cancels).  Since  it  is  monotonic 
increasing  in  the  first  argument,  it  suffices  to  maximize  the  efficacy 


(6.3.11) 


N 


e{cA,J{M)=G{cA)G{J[-),f  {■))-- 


[//(u  )</.(u  ,/  )du\‘‘ 


1=1 


N 


-t\2 


^E(ci-c) 


J [J{u  )-J]'^du 


For  future  purposes,  G{J ,f  )  is  also  given  by 

OO  00 

1/  J(F{x))f'{,)dzf  1/  J'(F{x))f\x)dxf 


G(/(-),/  (■))  =  - 

f  [J{u  )  -  J\^du 
0 


/  )  “  J^du 


(6.3,12) 


where  the  second  equality  is  valid  for  bounded  and  differentiable  /(*). 

From  the  Cauchy-Schwartz  inequality,  we  must  have  for  a  maximum 


-  c  =  di  -  d  = 


(M+1) 


+  5^(1  -  U{i  -  n  -  1)) 


(6.3.13) 


Since  J  <f){u  J  )  =  J  f  '{x)dx  =  0,  we  can  replace  in  the  numerator  of  (6.3.11)  J{u) 

0  -OO 

by  /(u)  -  and  from  the  Cauchy-Schwartz  inequality  we  must  have  for  a  maximum 
J{u)  =  (f>{u  ,f  ).  Hence  the  optimal  scores  are  f  (f>{u  ,  f  ).  Substituting 

(6.3.13)  into  (6.2.26), 


J  ^ 


Tn  =  )ajv(^iVi ,/  )  =  E  «iv(^M  J  )  -  ,  E  o^Ni^Ni  J  )  (6.3.14) 

1  t=l 


and  the  last  term  vanishes  by  virtue  of  (6.2.19).  Also, 
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Mn 


{M+lf 


n  (1 


M+1 


■)  (6.3.15) 


Under  Hq,  is  one  sided  exponentially  distributed,  and  (6.3.5)  is  obtained  from 
(6.2.29)  and  (6.3.15).  For  a  Rayleigh  signal,  is  again  one-sided  exponential,  as  in 
Chapter  4.3.  Combining  (6.2.15),  (6.3.11)  and  (6.3.15),  yields  (6.3.6)  and  (6.3.7),  Q.E.D. 


Remarks  1)  Proposition  6.3.1  remains  valid  when  the  definition  of  the  class  (6.3.3) 
allows  different  and  c  vectors  for  the  x  and  y  channels.  2)  Note  that  in  the  optimal 
test,  the  known  signal  sequence  is  correlated  with  the  vector  of  scores,  and  not  with  the 
raw  observation  vector.  This  is  also  true  for  all  the  other  tests  that  follow. 


Corollary  6.3.2  For  a  constant  signal,  s,-  =  1,  SNR  [J  ,s) 


na 


M 


I{f)-  As 


2  M+1 

M  — >  oo,  it  approaches  the  performance  of  the  parametric  locally  most  powerful  (LMP) 
quadrature  test 


Qlmp  = 


f'M 

f  (a;.- ) 


2 

+ 


M 


E 


Si 


fiVi)? 

f  iVi) 


(6.3.16) 


which  necessitates  complete  knowledge  of  /  (■)  to  obtain  the  CFAR  property.  While  the 
rate  of  convergence  with  increasing  width  of  the  reference  window  is  low,  the  losses  for 
small  M  are  reasonably  low,  e.g.  3dB  and  IdB  for  M  =  1  and  4,  respectively. 


Proof  Computation  of  the  asymptotic  detection  probability  for  the  test  based  on  the 
statistic  (6.3.16)  is  similar  to  that  in  the  proof  of  Prop.  6.3.1.  Substituting  s'=l  in 
(6.3.7)  shows  they  are  identical  when  M—*oo. 

By  the  way  we  have  defined  the  parameters  in  (6.3.1),  aA  represents  the  unknown 
reflection  characteristics  of  the  target  signals,  whereas  the  {s,- }  are  at  the  control  of  the 
system  designer.  In  particular,  it  is  possible  to  obtain  arbitrary  {s,-  }  by  introducing 
phase  modulations  in  the  transmitted  high-frequency  pulse-train.  In  this  way,  none  of 
the  transmitter  power,  which  determine  the  achievable  detectability,  is  lost.  From 
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(6.3.7)  or  (6.3.15),  it  is  clear  that  even  better  performance  than  that  of  Corollary  6.3.1  is 
possible. 

Corollary  6.3.3  (The  optimal  signal.)  With  zero  mean  signal,  5=0,  the  best  perfor¬ 
mance  can  be  achieved  even  with  M  =  0,  i.e.  without  any  reference  window!  In  partic¬ 
ular,  for  even  n  ,  with  (n  /2)  phase  shifts  of  tt  on  the  transmitted  pulse  train,  at  arbi¬ 
trary  order,  one  obtains  the  the  simple  statistic 

Tn  =  E  E  <^n[RniJ)  (6.3.17) 

<t>,  =  0  <!>,  =  T! 

Notice  that  CFAR  is  obtained  without  any  assumption  of  symmetry  of  /  (•),  and 
no  use  is  made  of  the  sign  of  the  observations.  Therefore,  a  quadrature  test  based  on 
(6.3.17)  will  be  the  best  for  multiple  targets  and  non-stationary  environment.  The 
statistic  (6.3.17)  is  also  very  reasonable  for  finite  n  .  For  large  SNR,  at  both  quadrature 
channels,  the  ,  4>i  =  0}  will  tend  to  be  among  the  lower  or  upper  half  of  the  rank 
vector,  depending  on  the  arbitrary  target  phase  9.  As  a„{i)  is  increasing  with  i,  see 
Lemma  6.2.5,  the  absolute  value  of  grows  with  SNR,  and  achieves  its  maximum 
value  when  SNR  -+  oo  and  the  ranking  is  perfect  (i.e.,  R„i  >  n  /2  or  R„i  <  n  /2,  V 

h  =  0). 

Proposition  6.3.4  Under  the  alternative  (6.3.1)  for  Hi  and  a  one-sample  situation,  if 
lim  a^n  <  oo,  the  asymptotically  optimal  test  is,  for  any  distribution  of  A  , 

(  E  sign  {xi  )a„  [Rfi^J  )]^  +  [  E  iVi  )«n  {Rni^J  )]^  in  («)  (6.3.18) 

i=l  1=1  ^0 

where  {i  J  )  J[u  )  =  (f>^{u  J  )  and 

1 

tn  (^)  =  -21oga  n J  J\u  )du 
0 

For  a  Rayleigh  signal,  SNR  (/  )  =  -^^^/(/  ). 

2 


(6.3.19) 
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Proof  Parallels  the  proof  of  Proposition  6.3.1,  using  Theorem  6.2.10. 

Remarks  1)  The  performance  is  identical  with  the  test  of  corollary  6.3.3.  However,  this 
test  is  more  complicated  to  implement  as  the  sign  vectors  are  also  needed,  and  is  less 
robust  as  it  is  CFAR  only  for  symmetric  /  (■).  2)  With  0^(2 )  ==  i ,  this  is  the  Wilcoxon 
narrowband  detector,  which  has  been  studied  by  Helstrom,  Carlyle  and  Hansen  [40]. 
However,  no  optimum  properties  of  this  test  were  discussed  by  these  researchers. 

Similarly,  for  single-sweep  ranking,  Chapter  6.2.1c, 

Proposition  6.3.5  Under  the  alternative  (6.3.1)  for  H q,  the  single-sweep  asymptotically 
optimum  test  is,  if  lim  na'^  <C  00,  for  any  distribution  of  A  , 

[  E  Si  «M+i(^(M+i)i,/  )]^  +  [  E  Si  aM+i(^fAf+i)i,/  )]“  (a)  (6.3.20) 

where  aA/+i(c/  )  given  by  Eq.  (F.8)  of  Appendix  F,  and 

1  Af  +1 

(a)  =  -21oga  (/ ,/  )  (6.3.21) 

For  a  Rayleigh  signal, 

'H/7  2  1  A/  +1  2 

SNR  =  ^  [‘.I  (6.3.22) 

Thus,  with  large  M ,  single-sweep  rank  tests  do  not  incur  any  loss  and  are  appropriate 
for  homogeneous  environment. 

Proof  See  Appendices  E  and  F,  Unlike  the  previous  cases  of  one  and  two-sample  rank 
tests,  the  derivation  is  based  on  a  simpler  asymptotic  distribution  result,  as  the  channels’ 
statistics  in  (6.3.20)  are  the  sum  of  i.i.d.  variables. 

6.3.2  -  Some  Optimal  Score  Functions  (See  also  [49]). 

i)  For  Gaussian  /  ,  <t>{u  J  )  =  (jr^[u).  When  the  scores  are  computed  according  to 
(6.2.20),  i.e.,  the  various  rank  tests  are  of  the  Van  der 
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Waerden  (VDW)  type.  If  the  scores  are  computed  from  (6.2.16),  they  are  of  the 
Fisher-Yates-Terry-Hoeffding  (FYTH)  type.  Both  are  asymptotically  equivalent. 

ii)  For  logistic  p.d.f.,  /  =  (1  +  e'*  )-2,  ,/ )  =  2(u  -  1/2).  Computed  by 

(6.2.20),  the  a^(t)  are  the  Wilcoxon  scores  t  /(A^+1)  -  1/2. 

iii)  For  double  exponential  /  ,  )=sign(u -1/2).  The  corresponding  two-sample 

^  .  N 

test  IS  of  the  median  type,  T'=  ^  sign(i?jVi - — )•  For  one-sample  situation, 

i=i  2 

(see  6.2.17),  thus  and  a  narrowband  sign  test  is 

obtained. 

iv)  For  the  “least  favorable”  p.d.f.  from  an  e-mixture  family  with  a  Gaussian  nominal, 
Eq.  (4.13),  f'/f{F~\u))  is  constant  for  |F“^(u)|  <k,  and  in  this  interval 
F(a: )  =  (1  -  e)  <l>[x )  +  e/2;  thus  F~^[u )  can  be  solved  explicitly  and  it  is  obtained 
(see  also  [3,  pp.  99]) 


k 


>f  )=< 


W-€  ^ 


W  >  J  +  {1  -  e)  (l>{k) 
±+{l-e)cl>{-k)<u  <|-b(l-€)  cf>{k) 


-k  ,u<— -|-(l-e)  (f>{-k  ) 


(6.3.23) 


The  various  rank  tests  designed  with  this  score  function  possess  a  maximin  pro¬ 
perty  as  will  be  shown  in  Section  6.5.  We  will  refer  to  them  as  the  robustified 
VDW  type  (RVDW).  The  robustness  results  from  limiting  the  influence  of  the 
extreme  ranks.  As  e  — >■  1,  the  RVDW  tests  clearly  reduce  to  the  median  tests. 


6.3.3  -  Comparison  Between  Two-Sample  and  Single-Sweep  Tests 

In  general,  the  relative  performance  between  the  tests  that  are  based  on  different 
ranking  structures,  depend  on  the  underlying  noise  p.d.f.  and  on  the  score  function. 

With  the  optimal  score  function  for  a  known  /  ,  the  ARE  is  found  from  (6.3.7)  and 


(6.3.22)  as  the  ratio  of  efficacies, 
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^■^•£'25 ,55  = 


-25 


'55 


=  (1- 


5^ 


M+1 


■)  ni ) 


M+l 


M+1 


E  “M+l  (^/  ) 


-1 


(6.3.24) 


where  subscripts  SS  and  25  denote  single-sweep  and  two-sample,  respectively. 

Assuming  that  N  =  is  sufficiently  large  (but  finite)  such  that  asymptotic 

normality  has  been  reached,  an  approximation  for  the  relative  efficiency  (RE)  is  obtained 

N 

when  in  the  above  /(/ )  is  replaced  by  )  A  {l/N)  ,f  )■  Notice  that 

1 

(6.3.24)  with  s=0  gives  also  the  ARE  with  respect  to  the  previously  studied  one-sample 
tests. 


Table  6.1  displays  /^(/  )  with  VDW  scores.  Table  6.2  shows  the  RE  for  Gaussian 
f  ,  n  =20  and  different  window  sizes.  It  is  seen  that  in  a  non-stationary  environment, 
when  small  M  is  preferred,  the  loss  incurred  by  single-sweep  ranking  is  substantial. 

-0v(/  )  for  the  “least  favorable”  /  *(e  =  0.1)  with  the  RVDW  [k  =  1.14)  scores, 
generated  from  (6.3.23)  according  to  (6.2.20),  is  shown  in  Table  6.3.  As  the  rate  of  con¬ 
vergence  to  /(/  *)  =  .671  is  much  faster,  the  loss  from  single-sweep  ranking  is  smaller. 
For  n  —  20,  M  =  3,  RE gg  =1.31  only.  (Of  course,  the  absolute  efficiency  for  each 
of  the  tests  for  /  *  is  smaller  by  0.671  compared  to  the  Gaussian). 

With  the  Wilcoxon  scores,  the  ARE  of  single-sweep  ranking  with  respect  to  the  full 
ranking  schemes  takes  a  very  simple  form: 


Proposition  6.3.6  The  ARE  of  single-sweep  ranking  with  Wilcoxon  scores  is  indepen¬ 
dent  of  f  and  is  given  by 

■^^^2S,SS  ~  [1  ~  [1  +  2/A/]  (6.3.25) 

AREig  gg  =  1  2fM  (6.3.26) 

The  proof  is  given  in  Appendix  G,  where  it  is  shown  that  for  /(u )  =  u  *  -  1/2,  and 

a{i)=i/{N  4- 1)  -  1/2,  J  J {u)(f){u  J  )du  is  proportional  to  ^  a(0®A^(G/)  for  a-H 

0  i  =1 
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/  .  As  for  the  VDW  scores,  for  non-stationary  environment  when  large  M  cannot  be 
taken,  Eqs.(6.3.25-26)  show  that  single-sweep  ranking  incurs  considerable  loses;  neverthe¬ 
less,  the  ARE  is  clearly  bounded  by  3  (—+5  dB  loss). 


6.3.4  -  Comparison  with  the  SW-SSQME  Test 

From  Eq.  (5.6),  the  detection  probability  of  a  Rayleigh  signal  with  the  SW-SSQME 
CFAR  detector,  at  the  noise  p.d.f.  against  which  it  is  optimized,  is  given  by 


^sw{f  >o;) 


1  + 


a 


r\lM  _  j 


1 


-M 


SNRsiv  =  ^nf)  (6.3.27) 


The  detection  probability  for  the  two-sample  optimum  rank  test  (2SR)  is  given  by 

(6.3.6),  where  the  SNR  with  a  constant  signal  (the  worst  case)  is 
2 

SNR  2sr  ^  — 2 — ^  ^  limit  distributions  are  different  for  finite  M ,  a 

single-number  measure  of  performance  like  the  ARE  is  inappropriate.  In  Table  6.4,  we 

2 

compare  the  effective  integrated  SNR,  ISNR  ),  which  is  required  to  attain 

specified  a  and  jS,  equal  for  both  detectors. 

ISNRsw  =  (a-'/^  -  l)/(/r'/^  -  1)  -  1  (6.3.28) 

ISNR2SR  =  (loga/log/3-  1)  (6.3.29) 

Notice  that  the  performance  for  the  optimal  one-sample  rank  tests  is  given  by  the 
M  00  column. 

The  performance  of  the  optimal  rank  tests  is  seen  to  be  substantially  superior 
unless  M  is  large.  Therefore,  as  was  promised  to  be  shown  at  the  beginning  of  this 
chapter,  they  are  to  be  preferred  in  a  nonhomogeneous  environment  where  M  must  be 


restricted. 
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N 

In  if  ) 

N 

iN(f) 

4 

.386 

100 

.923 

6 

.497 

120 

.933 

8 

.570 

140 

.940 

10 

.622 

160 

.946 

12 

.661 

180 

.951 

14 

.691 

200 

.955 

16 

.716 

400 

.974 

32 

.822 

600 

.982 

64 

.892 

1200 

.990 

Table  6.1 


N 


^k(/ )  —  £  ir‘( 


(N+l) 


■)]^  .  Gaussian 


/ 


M 

1 

3 

5 

7 

9 

11 

15 

31 

63 

RE  Of;  <;(; 

1.34 

1.76 

1.56 

1.45 

1.38 

1.33 

1.26 

1.16 

1.09 

2.67 

2.35 

1.88 

1.66 

1.54 

1.45 

1.35 

1.20 

1.11 

Table  6.2  Relative  efficiencies  of  one-sample  and  two-sample  ranking 
compared  to  single-sweep  ranking.  VDW  scores,  5=l,n=20. 


N 

4 

6 

10 

16 

32 

64 

100 

200 

oo 

InH*) 

.508 

.582 

.608 

.632 

.652 

.661 

.665 

.668 

.671 

Table  6.3  /^(/  *)  for  the  least  favorable  /  *(€=0.1).  RVDW  scores  with  k=1.14. 
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M 

1 

2 

4 

8 

16 

32 

128 

CO 

ISNRsw  [dB) 

19.9 

13.2 

10.2 

8.8 

8.2 

7.8 

7.6 

7.5 

ISNR^^nidB) 

10.5 

9.3 

8.5 

8.0 

7.8 

7.6 

7.5 

7.5 

Table  6.4  Effective  integrated  SNR  for  SW-SSQME  and  2-sample  rank  tests, 
both  optimized  for  the  same  noise  p.d.f.  a)  =10“^,  F,;=0.5. 


M 

1 

2 

4 

8 

16 

32 

128 

oo 

ISNRsw  {dB) 

29.5 

22.2 

19.0 

17.6 

20.0 

16.6 

16.4 

16.3 

ISNRo<,p(dB] 

19.3 

18.1 

17.3 

16.8 

16.6 

16.4 

16.3 

16.3 

Table  6.46  )  P,;=0.9. 


M 

1 

9 

4 

8 

16 

32 

128 

OO 

ISNRsw  {dB) 

60.0 

33.8 

22.1 

17.0 

14.8 

13.7 

13.0 

12.8 

ISNRo<;p{dB) 

15.8 

14.5 

13.7 

13.3 

13.0 

12.8 

12.8 

12.8 

Table  6.4c  )  Py.„=l0"®,  P,;=0.5. 


M 

1 

2 

4 

8 

16 

32 

128 

OO 

ISNRsw  {dB) 

69.5 

42.7 

30.6 

25.4 

23.2 

22.2 

21.4 

21.1 

ISNRo<;j,(dB) 

24.2 

22.9 

22.1 

21.7 

21.4 

21.3 

21.2 

21.1 

Table  6. 4d)  Pya=10®,  P,;=0.9. 
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6.4  The  Natural  Robustness  of  Rank  Tests 

The  SW-CFAR  version  of  the  parametric  UMP  detector  for  narrowband  fading  sig¬ 
nals  in  Gaussian  noise,  Eq.  (2.8),  is  extremely  non-robust.  Its  effective  SNR,  which 
determines  the  detection  probability  according  to  (6.3.6)  or  (6.3.10),  is  inversely  propor¬ 
tional  to  the  total  noise  variance.  In  an  unrestricted  e-mixture  family, 
^  =  (1  -  e)  +  €.<7^  can  approach  infinity  for  very  spiky  noise,  hence  detectability  could 
be  completely  lost.  The  robust  tests  that  were  studied  in  Chapters  4  and  5  are  immune 
to  heavy  tailed  contaminations,  but  once  the  robust  estimator  was  designed  according  to 
some  k{e^),  if  the  actual  contamination  e  is  larger  and  e  — *■  1,  the  effective  SNR  ^  0. 
From  Eq.  (5.3),  if  all  the  contamination  lies  outside  [-k,  k],  we  obtain 


Y _ )-2kg  {k  )+k^e/{l-e) 


e-*l 


OO 


(6.4.1) 


However,  it  turns  out  that  the  various  rank  tests  inherently  possess  similar  immun¬ 
ity  against  heavy  tailed  noise,  and  in  fact,  the  detectability  loss  is  characterized  by  some 
power  of  (1  -  e).  This  is  seen  from  the  functional  that  influences  the  effective  SNR  and 
through  it  the  detection  probability,  G{J )  of  Eq.(6,3.12). 


Proposition  6.4  Assume  that  /(w)  is  monotonic  increasing,  and  denote  the  denomina¬ 
tor  of  (6.3.12)  by  J^.  Then,  if  the  noise  p.d.f.  belongs  to  a  mixture  family 
/  g=(l-€)/  Q+eh  ,  we  have: 

a)  If  /(•)  is  differentiable, 

)  >  (1  -  e)^[min  J'{u  )]2[/  /  ^  (a;  )dx  f  (6.4.2) 

U 

b)  If/(u)  is  piecewise  constant  and  discontinuous  at  u=ll2,  with  a  jump  A/(l/2), 

^  A/^(l/2)/^(0)  ^  (1  -e)^A/-(l/2)/o^(0) 


7^ 


G{JJ) 


(6.4.3) 
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Proof  Immediate  from  the  definition  and  the  assumptions,  using  /  ^  >  (1  -  e)^f  q  .  For 
Part  a,  from  the  variational  necessary  condition  for  a  minimum  of  G  with  the  con¬ 
straints  on  /  ,  it  can  be  shown  that  no  solution  for  it  exists  and  thus  the  inequality  in 
(6.4.2)  can  never  be  replaced  by  an  equality.  It  can  also  be  shown  that  as  the  spread  of 
the  contamination  increases,  G  approaches  its  lower  bound,  which  is  reached  with 
degenerate  contaminations  (e.g.,  h  =  U  [-a  , a],  a  — »■  oo  or  N(0,c^),  c  — ►  oo). 

Applying  the  proposition  for  the  various  score  functions,  for  a  Gauss-Gauss  mixture 
Fg(x  ;c  )—(l-€)(^(x  )-i-€(^(x  I  c),  we  obtain  after  solving  the  integrals 
i)  VDW  Scores  J^=  =  1,  min/'(-)  =  x/^. 


G{<l>  (u ),/  e)=2x 


/  ,{x  ;c  ))p}/  ^2  (2:  ;c  )dx 


> 


llzi)! 


(6.4.4) 


This  bound  is  not  tight;  for  small  e,  a  Taylor  expansion  yields 


G  (r'(» ).  f  =  1)1“  +  0  (£“(c  -  1)) 


(6.4.5) 


ii)  Wilcoxon  Scores,  7^  =  1/12,  /'(•)  =  1, 


G(«-l/2,/,)=i. 


TT 


(l-hC  2)^/2  c 


1 


>  -^(1-e)^ 

TT 


(6.4.6) 


iii)  Median  Scores  =  1,  /(O)  =  1,  using  part  b). 


G(sign(u  -  1/2),  /  J  =  4/  ^  (o)  =  A[i  _  g  +  i.]2  >  A  (i  _  e)2  (5  4 

TT  C  TT 


iv)  RVDW  Scores  The  robustified  VDW  scores  of  Eq.  (6.3.23),  designed  according  to 
where  is  the  design  e. 


G{RVDW,f  ,)= 


Stt 


(l-6,)2/(P*(e,)) 


/.-i 


1— 


(6.4.8) 

12 


|/  6^  (x;c  )dx 


l\]2 


>  L  0-JL.  iLzM-ll 
2  l-2cf>{-k) 
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where  x'  is  the  solution  of 

(1  -  ')  +  e<(>{x  '/c  )  =  (1  -  ),f>{k  )  +  ed/2  (6.4.9) 

Generally,  for  €>l-\/2/3=0.18  and  heavily  tailed  contaminations,  the  median  tests 
are  the  most  robust  (with  the  largest  loss  2/7r=-l.g6dB  at  the  Gaussian),  the  Wilcoxon 
tests  are  the  second  best  (with  merely  Z/tt  =  -0.2dB  loss  at  the  Gaussian),  and  the 
VDW  are  the  least  robust.  The  RVDW  is  intermediate  to  the  VDW  and  the  median 
tests. 

More  specific  conclusions  can  be  drawn  from  Tables  6. 5-6. 8,  which  exhibit  the  exact 
losses,  compared  to  the  UMP  for  Gaussian,  in  dB:  -101ogio(G  (/,/ J).  ((6.4.4)  and 
(6.4.8)  were  numerically  integrated).  Notice  that  the  performance  of  the  RVDW  tests  is 
not  identical  with  that  of  the  SSQME  tests,  as  the  functionals  G{J  J  )  and  l/V[l  J  )  of 
Eq.  (4.11)  are  different,  unless  both  J {■)  and  /(•)  are  the  optimal  for  /  .  However,  the 
difference  is  generally  small,  by  comparison  of  Tables  6.8  and  5.1. 


1.00 

3.33 

10.00 

33.3 

100.00 

0.01 

0.20 

0.30 

0.35 

0.37 

0.37 

0.10 

0.20 

1.25 

1.75 

1.95 

2.00 

0.20 

0.20 

2.33 

3.44 

3.88 

4.01 

0.30 

0.20 

3.45 

5.27 

6.04 

6.28 

0.40 

0.20 

4.59 

7.26 

8.49 

8.87 

0.50 

0.20 

5.74 

9.43 

11.30 

11.91 

Table  6.5  losses  (dB)  for  Wilcoxon  detector,  /  =(l-e)V(0,  l)-feV(0,c-) 


1.00 

3.33 

10.00 

33.3 

100.00 

0.01 

0.00 

0.22 

0.36 

0.41 

0.43 

0.10 

0.00 

1.67 

2.56 

2.89 

2.99 

0.20 

0.00 

3.01 

4.67 

5.34 

5.53 

0.30 

0.00 

4.26 

6.75 

7.83 

8.16 

0.40 

0.00 

5.43 

8.87 

10.48 

11.00 

0.50 

0.00 

6.53 

11.04 

13.40 

14.21 

Table  6.6  losses  (dB)  for  VDW  detector,  /  =(l-e)N(0,  l)+eA/’(0,c ^) 
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_ 

1.00 

3.33 

10.00 

33.3 

100.00 

0.01 

1.96 

2.02 

2.04 

2.05 

2.05 

0.10 

1.96 

2.59 

2.78 

2.85 

2.87 

0.20 

1.96 

3.27 

3.68 

3.83 

3.88 

0.30 

1.96 

4.01 

4.69 

4.95 

5.02 

0.40 

1.96 

4.81 

5.84 

6.23 

6.34 

0.50 

1.96 

5.70 

7.15 

7.72 

7.90 

Table  6.7  losses  (dB)  for  median  detector,  /  =(l-e)A''(0,  l)+eN(0,c^) 


1.00 

3.33 

10.00 

33.3 

100.00 

0.01 

0.06 

0.18 

0.24 

0.27 

0.27 

0.10 

0.06 

1.37 

2.16 

2.48 

2.57 

0.20 

0.06 

2.74 

4.33 

4.96 

5.14 

0.30 

0.06 

4.04 

6.45 

7.47 

7.78 

0.40 

0.06 

5.25 

8.59 

10.15 

10.64 

0.50 

0.06 

6.40 

10.81 

13.09 

13.86 

Table  6.8  Losses  (dB)  for  RVDW  rank  detector.  /  =(l-e)A'^(0,  l)+eA/'(0,  c^) 
a)  k==  1.945,  design  e=0.01,  G  {/  *)—-0.27  dB 


1.00 

3.33 

10.00 

33.3 

100.00 

0.01 

0.44 

0.52 

0.55 

0.56 

0.56 

0.10 

0.44 

1.26 

1.57 

1.68 

1.72 

0.20 

0.44 

2.15 

2.89 

3.19 

3.28 

0.30 

0.44 

3.14 

4.52 

5.15 

5.35 

0.40 

0.44 

4.24 

6.60 

7.86 

8.25 

0.50 

0.44 

5.43 

9.07 

10.95 

11.57 

Table  6.86  k=  1.140  design  e=0.1,  G{f  *)=-!. 73  dB 


1.00 

3.33 

10.00 

33.3 

100.00 

0.01 

0.77 

0.84 

0.86 

0.87 

0.87 

0.10 

0.77 

1.49 

1.72 

1.81 

1.84 

0.20 

0.77 

2.26 

2.81 

3.01 

3.07 

0.30 

0.77 

3.11 

4.07 

4.45 

4.56 

0.40 

0.77 

4.05 

5.57 

6.24 

6.46 

0.50 

0.77 

5.09 

7.45 

8.68 

9.11 

Table  6.8c  k=  0.862  design  e=0.2,  G  [f  *)=-3.11  dB 
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6\^C 

1.00 

3.33 

10.00 

33.3 

100.00 

0.01 

1.05 

1.12 

1.14 

1.14 

1.15 

0.10 

1.05 

1.72 

1.94 

2.01 

2.03 

0.20 

1.05 

2.45 

2.92 

3.10 

3.15 

0.30 

1.05 

3.24 

4.05 

4.36 

4.45 

0.40 

1.05 

4.11 

5.36 

5.86 

6.01 

0.50 

1.05 

5.07 

6.92 

7.74 

8.00 

Table  6,8d  k=  0,685  design  6=0.3,  G{f  *)=-4,51  dB 


"6\^C 

1.00 

3.33 

10.00 

33.3 

100.00 

0.01 

1.29 

1.35 

1.37 

1.38 

1.38 

0.10 

1.29 

1.94 

2.14 

2.21 

2.23 

0.20 

1.29 

2.64 

3.08 

3.24 

3.29 

0.30 

1.29 

3.40 

4.15 

4.43 

4.51 

0.40 

1.29 

4.24 

5.37 

5.81 

5.93 

0.50 

1.29 

5.16 

6.80 

7.47 

7.68 

Table  6.8e  k=  0.555  design  e=0.4,  G[f  '')=-6.02  dB 


c 

1.00 

3.33 

10.00 

33.3 

100.00 

0.01 

1.51 

1.57 

1.59 

1.59 

1.60 

0.10 

1.51 

2.15 

2.34 

2.41 

2.43 

0.20 

1.51 

2.84 

3.26 

3.42 

3.46 

0.30 

1.51 

3.58 

4.30 

4.56 

4.64 

0.40 

1.51 

4.40 

5.47 

5.88 

6.01 

0.50 

1.51 

5.31 

6.84 

7.46 

7.64 

Table  6.8/  k=  0.436  design  6=0.5,  G  [f  *)=-7.73  dB 


1.00 

3.33 

10.00 

33.3 

100.00 

0.01 

1.73 

1.80 

1.81 

1.82 

1.82 

0.10 

1.73 

2.37 

2.56 

2.62 

2.64 

0.20 

1.73 

3.05 

3.46 

3.61 

3.66 

0.30 

1.73 

3.79 

4.48 

4.73 

4.80 

0.40 

1.73 

4.60 

5.63 

6.02 

6.13 

0.50 

1.73 

5.49 

6.95 

7.52 

7.71 

Table  6.8^  k=  0.291  design  e=0.65,  G[f  *)=-10.96  dB 
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6.5  Asymptotically  Maximin  Rank  Tests 

The  effective  SNR  of  the  optimal  rank  tests  of  Section  6.3  were  shown  to  be  pro¬ 
portional  to  /(/  ).  Therefore,  the  least  favorable  p.d.f.  of  Eq.  (4.13)  is  a  candidate  for  a 
maximin  solution.  Notice,  however,  that  G{J J  )  of  Eq.  (6.3.12)  is  not  a  convex  func¬ 
tional,  and  the  maximin  property  is  therefore  not  a  consequence  of  the  minimax  property 
for  M-estimators.  Huber  treated  rank  estimators  for  location  in  [3].  It  turns  out  that 
their  variance  is  precisely  l/G[J  J  ).  Thus  the  minimax  property  of  the  estimation  vari¬ 
ance  is  translated  to  a  maximin  relation  on  the  detection  probability.  As  the  proof  in  [3] 
is  not  detailed,  we  complete  it  here. 

Proposition  6.5.1  Let  Pj  =  {/  =  (1  -  e)^  -|-  eh  },  where  g  is  known,  symmetric  and 
strongly  unimodal,  and  h  an  arbitrary  symmetric  p.d.f.  Denote  hy  f  ^  —  {1  -  e)g  +  eh  q 


the  p.d.f. 

which  minimize  /(/  ) 

in  Pe, 

Eq. 

(4.13).  Let 

®/v(0  ^  J o(w  )= 

-/o'(V(« 

))//o(^o-'(t^)).  Then, 

a)  (Huber) 

O 

o 

lA 

G{JoJ) 

\if 

€Pe 

(6.5.1) 

b)  For  arbitrary  positive  r.v.  A  in  (6.3.1)^ 

«/  )=E^EAf 

U  ,^)  >  I3{f  o) 

V/  ep, 

(6.5.2) 

i.e.,  combining  with  Propositions  6.3.1  and  6.3.4,  the  various  one-  and  two-sample  rank 
tests  are  asymptotically  maximin  robust. 

Proof  a)  Since  the  denominator  of  (6.3.12)  is  not  a  function  of  /  ,  it  suffices  to  show 

1  1 

/  Jo'{u  )/  o(Fo-'  {u  ))du  <  f  Jo'(u  )/  (F-\u  ))du 

0  0 

By  the  assumptions  of  symmetry  and  strong  unimodality,  /'  >0  and  both  integrands 

are  symmetric  around  u  =1/2.  Also,  Jo'(u)  —  0  outside  /«(£)= 
{u  G  [1/2,  Fo(^  (^))]}!  from  Eq.  (4.13).  Thus,  it  suffices  to  show 
/  o(-^o'^  (^  ))^  /  ))  Since  /jq=0  on  this  interval. 
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f  oiF-\u))<f  {F-\u)).  But  on  4  (e)  ^  {x  6  [0.^- (e)]}.  Fo{x)<F{xl  and 
•^o(O)  —  =  1/2-  Thus,  on  /„  ,  Fo{F~^{u))  <  F{F~\u))  =  u  ,  and  taking  F on 

both  sides,  0  <  F~\u  )  <  F {u)  <  k  . 

Finally,  by  virtue  of  the  strong  unimodality,  /  0(2:)  is  decreasing  on  [0,oo).  Com¬ 
bining  all  these  inequalities, 

fo{Fo^{u))  <  fo{F^\u))  <f{F-\u)) 

on  4(€),  Q.E.D. 

b)  Immediate  from  (6.3,10),  as  /?(/  |  A  )  is  monotonic  increasing  in  G[JqJ  ). 

Corollary  6.5.2  The  rank  tests  designed  according  to  the  RVDW  scores,  Eq.  (6.3.23), 
are  asymptotically  maximin  robust  within  a  mixture  family  with  g  —  A^(0,1), 

Remarks  1)  The  maximin  is  valid  only  for  symmetric  /  .  The  two-sample  and  the 
one-sample  tests  with  zero  mean  signal,  Eq.  (6.3.17),  remain  DF-CFAR  for  all  /  .  2)  For 
the  single-sweep  tests.  Proposition  6,5.1  is  valid  if  also  M  ^  oo,  since  the  functional  in 
Eq.(F.lO)  of  App.F  converges  to  G[J  J  ). 

6.6  Robust  rank  detectors  for  unknown  signal  frequency 

When  the  signal  frequency  is  unknown,  due  to  Doppler  shift  of  the  reflected  signal 
from  a  target  of  unknown  velocity  in  a  radar/sonar  situation,  the  situation  is  similar  to 
that  of  Chap.  5.3.  In  this  chapter,  two  different  ways  of  constructing  Doppler  bank 
detectors  are  investigated,  corresponding  to  the  different  ranking  structures.  We  show 
that  the  the  effective  SNR  is  given  by  the  same  expression  as  before,  multiplied  by  a  loss 
factor  for  unmatched  frequency.  Hence,  all  previous  conclusions  about  maximin  optimal¬ 
ity  remain  valid. 
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6.6.1  One  -  and  two  -  sample  rank  Doppler  detectors. 

The  distribution  of  the  observations  is  given  by  Eq.(6.3.1),  where  i9— *•/?,■ 


A  I  <0.5 


(6.6.1) 


and  /,  is  the  sampling  frequency.  As  with  the  SSQME  -  Doppler  test,  we  take 
f={k/n)fr  ,k=l,..,n,  and  construct  n  frequency  channels  by  the  “butterfly” 
transformation 


li  {k)  =  Re  [(xi  +jyi )  ,  g.  =  Im  [(a:,-  +  jy,- ) 


(6.6.2) 


where  =exp(-y  27r/n ).  Thus,  A  is  the  deviation  of  the  signal  frequency,  from  the 
center  of  the  nearest  filter,  normalized  by  the  filter  width  f  ^ /n  .  Notice  the  “butterfly” 
transformation  is  performed  only  on  the  test  cell,  but  not  on  the  noise  reference  observa¬ 
tions.  For  each  of  the  frequency  channels,  the  transformed  observations  are  ranked 
according  to 


=  rank  of  /,•  in  {/j, . ,  /„  ,  . ,  } 


(6.6.3) 


R^i  =  rank  of  Q,-  in  {Q  . ,  g„  ,  . ,  y^} 

where  the  k  (frequency)  index  is  dropped  for  simplicity  of  notation.  The  test  statistics 
for  each  of  the  frequency  channels  are  of  the  form  T  where 


(6.6.4) 


and  {s;  }  are  the  known  signal  modulations,  Eq.(6.3.2).  The  asymptotic  performance  of 
this  detector  structure  is  given  by  the  following. 

Proposition  6.6.1  a)  In  the  expression  for  the  efficacy,  Eq.(6.3.11),  the  first  term 
G  (s,  s)  is  replaced  by 


—  E  i^i-S /(M+l))s,-exp(y27ri  A/n  ) 
n 


2 


G  (s,  s.  A)  =  lim 


i  =1 


(6.6.5) 
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b)  For  a  zero  mean  signal,  5=0, 

G  (s,  s.  A)  =  lim 


1  ” 

—  Yj  2:n-z  A/n  ) 

^  t  =1 


(6.6.6) 


which  is  identical  to  the  loss  factor  of  the  quadrature  matched  filter  detector  (DFT),  for 
a  time  varying  signal  with  unmatched  frequency.  In  particular,  when  s,- =±1 

G(s,  s.  A)  =  [sin(7rA)/7rA]^  (6.6.7) 

c)  For  two  -  sample  rank  Doppler  tests,  with  s,  — 1, 


G{s,  s.  A)  =  -^_[sin(7rA)/7rA]2 


(6.6.8) 


The  proof  of  a)  is  given  in  Appendix  I;  b)  and  c)  are  straightforward  consequences. 
Note  that  a  stronger  statement  about  the  asymptotic  would  be  desired.  However, 
we  could  not  prove  or  disprove  that  it  is  given  by  substituting  the  modified  efficacy  into 
(6.3.10).  This  was  true  if  /  and  Q  of  (6.6.4)  were  shown  to  be  asymptotically  uncorre¬ 
lated,  see  App.  I.  Notice  that  /,•  is  uncorrelated  with  ,  but  in  general  for  non  -  Gaus¬ 
sian  {JQ ,  Yj* }  they  are  not  independent.  The  difficulty  is  in  finding  a  manageable  expres¬ 
sion  for  R^^=m  },  which  is  needed  to  compute  E{IQ).  Due  to  the  ranking 

operation,  and  R^j  are  not  i.i.d.,  even  though  and  Y j  are.  However,  simulation 
results  indicated  the  stronger  statement  is  probably  valid,  besides  being  valid  for  Gaus¬ 
sian  noise. 


6,6.2  Single  -  sweep  rank  Doppler  detectors 

Though  a  Doppler  -  bank  of  detectors  could  be  constructed  in  the  previous  manner 
with  identical  results,  for  single  -  sweep  ranking  a  simpler  solution  in  terms  of  implemen¬ 
tation  complexity  exists.  It  turns  out  that  it  is  possible  to  interchange  the  order  of  rank¬ 
ing  operation  with  the  frequency  down  -  conversion,  without  any  degradation  in  the  per¬ 
formance  !  Specifically,  let 
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R{M+i]i  =  rank  of  x.-q  in  [x^q,  x^,  . ,  } 


(6.6.9) 


>RfM+i).-  =  rank  of  2/,-o  in  {y.-Q,  2/ib  . ,  Vm} 

A  complex  vector  of  the  scored  single  -  sweep  ranks  is  generated, 

Xi  =  Ui  +  jvi  ,  Ui  =  aM+i{RfM+i)i)  ,  Vi  =  aM+i{R(M+i)i)  (6.6.10) 


and  the  test  statistic  for  the  A:'*  frequency  channel  is  taken  as  the  DFT 


(6.6.11) 


Notice  that  since  only  a  single  rank  at  each  sweep  (the  index  i)  is  needed,  the 
ranking  complexity  is  linear  -  0{M),  as  the  f'*  rank  can  be  computed  from 


M 


R(M+I)i=  S  {^n(^)}  can  be  computed  by  a  FFT  algorithm,  and  with  the 


k  =0 


same  simplifying  assumptions  of  Chap.  5.3,  the  complexity  of  the  FFT  part  of  the  detec¬ 
tor  is  O  (5n  log2n -t-4n  ).  Combining,  the  total  complexity  is  O  (2nM+5n  log2n  4-4n  ). 
This  is  in  sharp  contrast  with  the  complexity  of  the  previous  structures,  which  is 
O  (2n^log2n -M0n^-|-4n )  for  one  -  sample  ranking  (with  QUICKSORT  algorithm),  and 
even  more  for  the  two  -  sample  detector.  In  App.  I,  we  prove  the  following  result  on  the 
asymptotic  performance: 

Proposition  6.6.2  For  na^—^c^,  the  limiting  detection  probability  of  the  test  statistic 


(6.6.11)  is 


cAC^/^(ajVf+i,  /  )  I  sin7rA/7rA  I  ,  \/-2\oga  (6.6.12) 


where  Q  [•,  •]  is  Marcum’s  Q  -  function  and 


M+l 


(6.6.13) 


As  a  consequence,  all  previous  conclusions  on  maximin  optimality  remain  valid. 
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Why  does  the  DFT  after  ranking  scheme  work  in  this  situation  ?  The  key  factor  is 
the  independence  of  the  {R[M+i)i}i  each  observation  is  ranked  only  with  respect  to 
its  adjacent  noise  reference  samples.  Thus,  when  the  SNR  is  large,  when  signal  of  fre¬ 
quency  k  is  present,  the  {u,  }  and  {u,-}  oscillate,  with  a  period  corresponding  to  A: .  Of 
course,  all  higher  harmonics  of  k  are  present,  due  to  the  nonlinear  operation  a  (R,- (xj-g))- 
The  essence  of  the  above  result  is  asymptotic  linearization,  i.e.,  the  first  harmonic  is  the 
dominant  term,  and  therefore  the  same  loss  factor  of  the  DFT  detector  is  obtained.  The 
same  can  not  be  true  for  the  other  ranking  structures,  due  to  the  strong  dependency  in 
the  vector  of  ranks. 
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6.7  Finite  sample  simulation  results 

6.7.1  The  cases  studied 

The  performance  under  the  hypothesis  and  the  alternative  was  investigated  by 
means  of  Monte-Carlo  (MC)  simulations,  for  various  quadrature  rank  tests,  under 
different  noise  situations. 

One-  and  two-sample  tests  have  been  simulated,  with  the  WilcoxonCVVX),  normal 
scores  according  to  Van  der  Waerden(VDW)  version,  and  the  robustified  VDW  (RVDW) 
score  functions  (see  Sections  6. 3. 2-6. 3. 4.).  In  the  following,  the  notation  VDW(n,M)  will 
designate  a  VDW  test,  with  n  main  samples  and  nM  noise-reference  samples.  For  the 
RVDW  test,  a  third  number  in  the  brackets  will  indicate  the  factor  k  at  which  the  score 
function  is  limited,  see  Eq.(6.3.23).  M  =  0  will  designate  a  one-sample  test  with  a  zero 
mean  known  signal  sequence,  according  to  Eq.(6.3.17).  One-sample  tests  that  are  based 
in  addition  on  the  signs  of  the  observations,  were  not  studied.  As  discussed  before,  they 
are  inferior  under  situations  of  non-symmetric  noise  p.d.f.  At  this  time,  we  do  not  yet 
have  results  for  single-sweep  rank  quadrature  tests. 

For  the  detection  performance,  8  different  Gauss-Gauss  noise  mixtures 
(1-€)V(0,  1)  +  eN{0,  c^)  have  been  simulated.  They  will  be  denoted  according  to 
Table  6.7.1.  As  is  evident  from  the  table,  all  contaminated  cases  represent  large,  heavy 
tailed  deviations  from  the  nominal  assumption,  with  the  total  noise  power  increased  by 
10  to  20  dB. 


Ci 

total  noise  variance  in  [dBl 

/l 

0.0 

0 

/  2 

.01 

30.04 

10 

fz 

.1 

9.54 

10 

/4 

.1 

31.48 

20 

/  5 

.2 

6.78 

10 

/  6 

.2 

22.27 

20 

/  7 

.4 

4.85 

10 

Xa_ 

.4 

15.76 

20 

Table  6.7.1  The  studied  noise  p.d.f.’s. 
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6.7.2  Performance  under  the  null  hypothesis 

Under  the  hypothesis,  as  long  as  all  the  observations  are  i.i.d.,  the  various  tests  stu¬ 
died  are  distribution  free.  Therefore,  a  major  simplification  and  saving  in  execution  time 
was  possible  -  instead  of  generating  the  original  observables  and  ranking  them,  the  ran¬ 
dom  rank  vector  was  generated  directly,  in  the  following  manner.  An  integer  in  [l,...,  n  ] 
was  drawn  with  a  uniform  probability  l/N ,  stored  as  i?  i,  and  deleted  from  the  original 
group  of  integers.  At  the  second  stage,  i?2  was  drawn  from  the  reduced  group,  indepen¬ 
dently  of  Ry  with  a  uniform  probability  l/[N-l),  and  so  on.  For  the  two-sample  tests, 
only  n  «  N—n{M-\-l)  ranks  were  generated,  as  the  other  ranks  are  not  required  for 
computation  of  the  test  statistic.  In  this  way,  the  probability  distribution  of  the  rank 
vector  is  as  required  by  (6.2.2).  From  the  vector  of  ranks,  the  various  quadrature  test 
statistics  were  generated  according  to  (6.3.4)  or  (6.3.17),  with  the  previously  mentioned 
score  functions.  The  known  signal  sequence  was  s,  =l  for  two  -  sample  tests,  and  s,'=l 
or  -1  (equal  number),  according  to  (6.3.17)  for  one  -  sample  tests. 

About  45  cpu  minutes  were  required  on  the  IBM/3081,  with  N —M,  2T0®  repeti¬ 
tions  of  the  Monte  -  Carlo  experiment,  where  all  3  detectors  (WX,  VDW,  RVDW)  simu¬ 
lated  simultaneously  on  the  same  vector  of  ranks.  On  the  DEC-VAX/750,  about  35  cpu 
hours  were  required  for  the  same  simulation  size 

Figures  6. 7. 1-6. 7. 3  exhibit  -log(F^(,)  vs.  the  threshold  multiplier  t,  for  tests  of  the 
H 

form  where  is  the  quadrature  statistic  on  the  left  hand  side  of 

^0 

either  Eq.(6.3.4)  or  (6.3.17).  The  RVDW  curves  are  with  k  =  1.14.  £'o(^Ar)  is  calcu¬ 
lated  from  (6.2.11).  In  this  way,  a  convenient  universal  comparison  of  Pf^  for  different 

^  A  unique  variant  of  the  Importance  -  Sampling  technique,  suitable  for  the  class  of  detectors  stu¬ 
died  here,  was  also  developed.  Although  in  one  case  it  enabled  us  to  obtain  reliable  thresholds  down 
to  Pfa=lO~^  with  only  20,000  repetitions,  a  100  fold  saving,  many  trial  and  error  recursions  were 
required  until  a  satisfactory  distorted  probability  measure  was  found  to  generate  smooth  estimates 
of  Pfa-  The  distorted  measure  depended  on  the  sample  size  and  score  function,  thus  large  human 
effort  was  needed  to  cover  all  the  desired  cases.  Since  computer  resources  have  been  readily  avail¬ 
able,  most  of  the  thresholds  were  obtained  from  the  simpler  "brute  -  force”  simulation. 
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sample  sizes  is  possible,  and  convergence  to  the  asymptotic  result  is  measured  by  the  dis¬ 
tance  of  the  Monte-  Carlo  curves  from  the  straight  line.  Explicitly,  the  normalizing  fac¬ 
tor  £'o(rjv)  is  given  by; 

2n‘^M{N +1)/12  -  WX,  two  -  sample 

2n  ^(n -l-l)/12  -  WX,  one  -  sample 

2^nM/{M+l)  -  VDW  and  RVDW,  two  -  sample 

2a^n  -  VDW  and  RVDW,  one  -  sample 

N 

where  a/v=(l/W)^  «iv(0-  Numerical  values  of  ^  are  given  in  Tables  6.1  and  6.3  of 

t=i 

Section  6.3.3. 

The  figures  show  that  with  all  score  functions,  for  (n=16,  M=0)  there  is  a  large 
deviation  from  the  asymptotic  prediction,  when  <10“^.  This  is  not  surprising,  since 
the  rank  statistics  are  bounded  r.v.’s.  For  this  case,  it  can  be  calculated  that 
max{  /F'o(  Tn )}  is  a  number  between  11  to  13  for  the  various  score  functions.  Asymp¬ 
totically,  t  (Fy  „  =10  ®)— 13.8,  thus  the  asymptotic  theory  can  not  predict  correctly  for 
this  low  level.  The  cases  (32,2)  and  (64,0)  essentially  converge,  while  (32,0)  and  (16,2)  are 
in  between.  With  respect  to  the  score  function,  RVDW  converges  faster,  though  the 
difference  is  small.  By  way  of  comparison,  the  SW-SSQME  robust  test  with  n  =  16 
essentially  converged  to  the  asymptotic  result,  see  Chapter  5.4.  This  is  so  because  the  M- 
or  L-  estimators  are  “closer”  to  sum  of  i.i.d.  r.v.’s  than  the  rank  statistic. 

Thresholds  for  the  F,;  simulation  were  taken  from  these  graphs,  with  some  smooth¬ 
ing  for  F ya=10”®.  In  the  simulation,  the  sensitivity  to  deviation  in  t  was  measured. 
As  result  of  the  smoothing,  we  estimate  the  accuracy  of  the  simulation  within 
±0.5dB  for  Fy„=10“®  (i.e.,  the  true  SNR  required  to  achieve  given  P^  is  within  this 
range  ),  and  essentially  error  free  for  Fy^  >10~^. 


Wilcoxon  typG  quadraturG  rank  dGtGctors.  Nr=2*10**6 
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Fig. 6. 7.1  False  alarm  probability,  Wilcoxon  quadrature  rank  tests. 
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Fig. 6. 7. 2  False  alarm  probability,  VDW  quadrature  rank  tests. 


^  9  s  r  e  z  1  0 


C  ojd  JOOI- 


THRESHOLD 


Robustified  VDW  quadratuPG  detector.  Nr=2*10**6 


-  151 


Fig. 6. 7. 3  False  alarm  probability,  RVDW  quadrature  rank  tests. 
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6.7.3  Probability  of  detection  performance 

The  simulations  for  detection  probability  implement  directly  the  model  (6.3.1)  and 
some  of  the  various  detectors  of  Sections  6.3,  6.5  and  6.6.  Independent  in-phase  and  (qua¬ 
drature  noise  r.v.’s  with  distributions  /  ^—f  g  are  generated  by  a  switching  device,  as  in 
section  5.4.  The  in-phase  and  quadrature  components  of  a  Rayleigh  signal  with  uniform 
phase  are  added  to  the  noise,  two  rank  vector  are  generated  from  the  observations,  and 
the  test  statistics  are  computed  and  compared  to  the  previously  found  thresholds.  For 
most  of  the  results  presented  here,  the  signal  frequency  is  assumed  known  and  the 
“butterfly”  transformation,  Eq.(6.6.2),  is  not  performed.  Figs.  6.7.21-6.7.23  correspond  to 
the  Doppler  rank  detectors  of  Section  6.6.1. 

In  a  typical  run,  the  3  different  detectors  (WX,  VDW,  RVDW)  of  equal  (n,M)  are 
simulated  on  the  same  observations,  at  3  Pf^  levels  (10“^,  10“^,  10“®),  and  at  20  different 
SNR  values.  On  the  IBM/3081,  about  4  cpu  minutes  were  required,  for  n=32  and  M=0, 
with  4  different  noise  distributions  and  2500  Monte-Carlo  (MC)  repetitions. 

In  all  the  following  figures,  is  plotted  vs.  the  nominal  integrated  SNR  na^/2, 
where  a^/2  is  the  input  SNR  at  the  nominal  component  of  the  mixture.  Thus,  the 
actual  input  SNR  is  lower  by  a  factor  of  l-e+ec^.  This  normalization  enables  us  to 
display  all  the  results  on  the  same  range,  and  it  will  be  seen  that  this  is  the  dominant 
factor  which  determine  the  detectability,  even  for  heavily  tailed  contaminations  (recall 
the  results  of  Section  6.4).  In  all  figures,  the  curves  marked  (UMP,e=0)  correspond  to 
the  UMP  detector  for  narrowband  slow  fading  Rayleigh  signal  in  narrowband  Gaussian 
noise  of  known  variance;  those  marked  (theory,  /  *(e))  are  computed  from  the  asymp¬ 
totic  Pi  at  the  least-favorable  p.d.f.  If  no  value  of  e  is  indicated,  e=0.1 

In  order  to  restrict  the  number  of  displayed  results,  we  shall  concentrate  on  the 
RVDW  detector.  Its  performance  as  function  of  the  sample  and  reference  sizes  will  be 
shown,  as  well  as  the  dependency  on  the  noise  distribution.  For  some  cases,  comparison 
with  the  other  detector  laws  (WX,  RVDW)  will  be  given. 
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Fig.  6.7.4  display  curves  of  RVDW(k=1.14),  with  various  (n,M),  against  Gaus¬ 
sian  noise.  In  general,  good  agreement  with  the  asymptotic  result  will  be  achieved  if  the 
curves  are  shifted  to  the  right,  from  the  (UMP,e=0)  curve,  by  the  dB  amount 
corresponding  to  the  efficacy  loss,  independent  of  P^.  It  is  seen  that  at  all 

curves  essentially  converge  in  shape;  the  loss  for  (64,0)  is  0.5-0. 8  dB,  compared  to  the 
efficacy  loss  of  0.44  dB,  see  Tbl.  6.8b.  Also,  the  one-sample  (16,0)  detector  is  better  than 
the  two-sample  (16,2)  by  about  1.5  dB,  in  agreement  with  M /{M+l)=  1.8  dB  (see  Eq. 
6.3.29).  On  the  other  hand,  as  expected,  deviation  from  asymptotics  is  larger  at 
F’/a=10'®.  Here,  the  distribution  of  RVDW(16,0,1.14)  is  very  different  in  shape  from  the 
asymptotic  result,  Eq.  (6.3.6).  For  the  other  sample  sizes,  the  shape  of  the  asymptotic 
distribution  is  virtually  obtained,  with  somewhat  increased  dB  losses.  In  this  case,  the 
(16,2)  detector  is  substantially  better  than  the  (16,0)  detector  (about  9  dB  at  P^=0.8), 
contrary  to  the  asymptotic  result.  The  dependency  of  the  RVDW  detector  on  sample  size 
at  the  /  3  distribution  (see  Table  6.7.1)  is  shown  in  Fig.  6.7.5.  Here,  agreement  with  the 
asymptotics  is  worse.  For  the  other  distributions,  all  the  large  e  cases  (/  3-/  g)  behaved 
similar  to  Fig.  6.7.5,  while  /  2  behaved  very  much  like  the  Gaussian.  Similar  qualitative 
behavior  was  found  with  the  VDW  and  WX  detectors.  In  fact,  our  version  of  the  one- 
sample  WX  detector,  behaves  very  much  like  Hansen’s  narrowband  detector  [40]  which 
utilizes  also  the  signs  of  the  observations. 

Roughly  speaking,  at  Fya=10“®,  the  performance  agrees  with  the  asymptotic  pred¬ 
iction,  with  somewhat  increased  SNR  losses,  for  n  >32  and  small  e,  for  all  score  func¬ 
tions.  For  larger  contaminations,  the  breakpoint  at  Py3=10“®  is  around  n  =32.  At 
P fa  =10“^,  the  (16,0)  detectors  also  come  close  to  the  asymptotic  prediction,  if  e<0.01. 

The  RVDW(16,0,1.14)  performance  at  Fya=10“'*  against  /  i-f  4  noises  is  shown  in 
Fig.  6.7.6.  Notice  that  the  Pj  degradation  is  ranked  according  to  the  total  noise  vari¬ 
ance,  and  for  the  noises  with  equal  variance  (/  2)  /  3)  -Prf  is  lower  when  e  is  larger,  as 
could  be  expected.  We  found  it  possible  to  improve  somewhat  the  performance  against 
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the  larger  contaminations  by  either  of  the  following  two  modifications.  In  the  first,  by 
taking  the  RVDW(0.862)  score  function  (it  is  asymptotically  maximin  robust  when 
e=0.2)  -  i.e.,  by  introducing  harder  limiter  on  the  score  function.  In  the  second,  we  took 
a(l)— a(16)=0  -  i.e.,  the  influence  of  the  extreme  ranks  was  totally  eliminated.  As  could 
be  expected,  this  improvement  also  resulted  in  higher  losses  at  the  nominal  and  /  2  dis¬ 
tributions.  Comparison  between  the  different  detector  laws  for  the  above  parameters  at 
/  4  is  shown  in  Fig.  6.7.7.  At  Prf=0.5,  RVDW  is  better  than  WX  by  1  dB,  which  in  turn 
is  better  than  VDW  by  1.3  dB.  These  differences  are  even  somewhat  larger  than  the 
asymptotic  ones,  compare  Table  6.8b  with  Tables  6.5  and  6.6.  Very  similar  superiority  of 
RVDW  was  obtained  for  /  3,  with  the  difference  negligible  for  /  2-  Notice  that  the  VDW 
detector,  which  is  the  asymptotically  optimal  rank  detector  for  Gaussian  noise,  is  quite 
robust  anyway,  in  agreement  with  the  discussion  of  Section  6.4.  This  is  in  sharp  contrast 
with  the  total  breakdown  of  the  parametric  LMP  detector  under  non  Gaussian  heavy 
tailed  noise. 

The  performance  of  RVDW(32,0,1.14)  for  all  levels  against  /  4-/  4  is  shown  in 
Figs.  6. 7. 8-9.  Since  convergence  to  the  asymptotics  is  almost  reached  at  Py^=lO“^,  the 
curves  are  almost  bounded  by  the  theoretical  for  the  least-favorable  distribution.  The 
detection  performance  is  worse  at  Pfa  ^  10~^.  though  the  detector  is  clearly  robust.  In 
Fig.  6.7.8  we  also  demonstrate  that  the  Gaussian  noise  provides  the  most  difficult  detec¬ 
tion  environment,  among  noises  with  equal  variance.  This  is  demonstrated  by  the  dashed 
rightmost  curves,  which  correspond  to  the  UMP  detector  against  narrowband  Gaussian 
noise,  with  variance  equal  to  that  of  /  2-/  4. 

Fig.  6.7.10  corresponds  to  the  two-sample  RVDW(32,2,1.14)  detector,  where  it  is 
seen  that  convergence  to  the  asymptotics  is  essentially  reached  also  for  P j ^  =10“®.  At 
-R/a=10~^  the  MG  losses  are  2.4  dB  for  /  4-/  2,  compared  to  2.3  and  2.4  dB,  respec¬ 
tively,  from  Table  6.8b  (adding  the  1.8  db  loss  appropriate  to  M/(M-t-l)).  For  73-/4, 
the  MC  loss  is  3.4  dB,  and  is  identical  with  the  efficacy  loss.  Notice  that  at  Pj^  =10“® 
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the  two-sample  detector  is  better  than  the  equivalent  one-sample  detector,  Fig.  6.7.9, 
which  contradicts  the  asymptotic  prediction.  A  possible  explanation  is  that  the  two- 
sample  detector,  which  use  more  observations,  almost  converges  to  the  asymptotics  at 
this  low  level,  while  the  one-sample  does  not. 

Comparison  of  the  different  detector  laws  with  (32,0)  at  /  3  noise  is  shown  in  Fig. 
6.7.11,  and  against  the  f  ^  noise  in  Fig.  6.7.12.  Again,  the  quality  ranking  is  in  accor¬ 
dance  to  asymptotic  results,  and  the  WX  detector  is  only  slightly  inferior  to  the  RVDW. 
For  the  large  contamination  of  /  4,  RVDW  is  superior  to  VDW  by  about  6  dB  at 
P^=0.7. 

Fig.  6.7.13  depicts  of  RVDW(32,0,.862),  which  is  asymptotically  maximin  robust 
for  £=0.2,  against  /  1,  /  4-/  5  noises.  Recall  £=0.2  for  /  5-/  g.  While  the  theoretical 
curve  for  the  least  favorable  /  *(0.2)  does  not  bound  the  MC  curves  at  the  highly  con¬ 
taminated  noises,  the  performance  can  be  considered  satisfactory  if  the  large  increase  in 
the  input  noise  variance  (10-20  dB)  is  accounted  for.  Comparison  between  different 
detector  laws  at  (32,0, 10““*)  is  shown  in  Figs.  6.7.14-6.7.16.  At  the  Gaussian,  as  predicted 
by  Tables  6.5  and  6.6,  WX  and  VDW  are  essentially  identical,  and  superior  by  about  0.8 
dB  over  RVDW(0.862),  an  excellent  agreement  with  Table  6.8c.  For  /  5-/  g,  however, 
RVDW  is  better  than  VDW  by  about  2.5  and  7  dB,  respectively,  at  =0.7.  WX’s  per¬ 
formance  is  in  between. 

Fig.  6.7.17  corresponds  to  RVDW(64,0,1.14)  against  /  4-/  4.  Comparing  it  with 
RVDW(32,0,1.14),  Fig.  6.7.9,  we  observe  an  improvement  in  convergence  at  Pyg=10“®. 
Now,  all  P^  curves  agree  in  shape,  though  the  losses  in  /  3-/  4  are  still  higher  than  the 
efficacy  losses. 

Fig.  6.7.18  shows  of  RVDW(64,0,.55),  which  is  maximin  robust  for  £=0.4.  The 
loss  at  the  Gaussian  is  increased  by  merely  0.5  dB  when  compared  to  the 
RVDW(64,0,1.14)  detector,  even  less  than  the  efficacy  loss  of  Table  6.8e,  but  now  the 
robustified  detector  is  seen  to  handle  reasonably  well  the  noises  /  7-/  g,  with  £=0.4.  The 
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large  advantage  of  RVDW  over  VDW  at  the  heavily  tailed  noise  /  g,  about  10  dB  at 
Fi=0.6,  is  clearly  observed  in  Fig.  6.7.20.  Comparison  of  Fig.  6.7.19  with  6.7.20  also 
reveals  that  the  heavier  tailed  noise  has  a  rather  limited  influence  on  RVDW,  but  it 
causes  a  higher  degradation  in  WX  and  VDW,  in  accordance  with  the  asymptotic  results; 
compare  Tables  6.5,  6.6  with  6.8e. 

The  performance  of  the  one-sample  rank  Doppler  RVDW(32,0,1.14)  detector.  Sec¬ 
tion  6.6.1,  is  exhibited  in  Figs.  6.7.21-6.7.23.  Fig.  6.7.21  corresponds  to  a  matched  fre¬ 
quency  case,  A=0.  Comparing  it  with  Fig.  6.2.9,  we  observe  slightly  lower  at  the 
Gaussian  and  /  2  noises,  but  some  improvement  for  the  heavier  tailed  /  3-/  4  noises. 
Similar  behavior  was  found  with  the  WX  and  VDW  detectors.  These  empirical  results 
support  the  conjecture  following  Proposition  6.6.1.  The  improved  performance  with 
higher  contaminations  can  be  heuristically  explained  by  the  following:  and  (Q,- ,  which 

are  weighted  sums  of  two  r.v.’s,  have  the  same  variance  as  Xi  and  y, ,  but  their  margi¬ 
nals  have  shorter  tails  as  consequences  of  the  convolution. 

The  frequency  mismatch  cases  are  shown  in  Fig.  6.7.22  (A==0.25)  and  Fig.  6.7.23 
(A=0.5).  The  latter  corresponds  to  the  crossover  frequency  between  adjacent  “filters”  in 
the  Doppler-bank  detector.  Notice  that  the  theoretical  curves  are  shifted  to  the  right 
from  those  of  Fig.  6.7.21  by  (sin7rA/7rA)^=0.9dB  and  3.9  dB,  respectively.  Thus,  they 
correspond  to  the  linear  DFT  detector  and  to  the  asymptotic  distribution  of  the  rank 
Doppler  RVDW  detector  (against  /  *),  under  identical  frequency  mismatch.  At 
Fya=10“^,  the  MC  curves  are  essentially  shifted  by  the  appropriate  amount  from  those 
in  Fig.  6.7.21,  validating  Prop.  6.6.1.  At  10“®,  where  convergence  to  the  asymptot¬ 

ics  is  not  yet  reached,  the  losses  are  somewhat  larger  than  the  above  values.  Neverthe¬ 
less,  as  before,  the  RVDW  detector  was  found  superior  over  the  WX  and  VDW  at  all 
heavy  tailed  noises  by  roughly  the  same  amount  as  in  previous  cases. 
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6.7.4  Comparison  with  the  robust  SSQME  detectors 

The  lack  of  convergence  of  the  RVDW(16,0)  detector  to  the  asymptotic  prediction 
(recall  Figs.  6. 7. 4-6. 7. 7)  might  seem  to  imply  that  it  is  inferior  to  the  various  robust 
SSQME  detectors.  However,  a  fair  comparison  must  be  based  on  the  same  number  of 
noise  reference  vectors.  Fig.  6.7.24  shows  RVDW(16,4,1.14),  and  Fig.  6.7.25  corresponds 
to  the  SW-SSQME  detector  with  4  reference  vectors  and  q;=0.3  (it  is  Fig.  5.17  repeated). 
Notice  that  the  /  4  noises  are  not  the  same  in  the  two  figures.  It  is  seen  that  RVDW  is 
superior  to  SW-SSQME,  by  about  2  dB  at  P^^=10~2  and  8  dB  at  ^^^=10’®.  These 
differences  are  in  good  agreement  with  the  asymptotic  prediction  of  Tables  6.4a-d:  with 
M=4,  the  theoretical  SNR  difference  is  1.7  dB  for  (Py„  =10'’“,  0.5<P^  <0.9),  8.4  dB  for 
{P /a  “10  =0.5),  and  8.7  dB  for  (Py^  =10“®,  =0.9).  Recall  from  Section  6.3.4 

that  they  have  different  limiting  distributions,  hence  the  SNR  loss  is  function  of  P^.  A 
detailed  comparison  of  the  integrated  SNR  {na‘^/2),  required  for  given  Py^  and  Pj,  is 
shown  in  Table  6.7.2.  We  thus  conclude  the  RVDW  is  superior  to  the  SW-SSQME. 


Pfa 

p<i 

detector  type 

/i 

/  2 

fz 

0.5 

RVDW(16,4,1.14) 

8.9 

9.1 

10.1 

10“2 

SW-SSQME(16,4,0.3) 

10.8 

10.9 

11.4 

0.9 

RVDW(16,4,1.14) 

17.8 

17.8 

19.3 

SW-SSQME(16,4,0.3) 

19.8 

19.8 

20.2 

0.5 

RVDW(16,4,1.14) 

14.6 

14.8 

16.3 

10“® 

SW-SSQME(16,4,0.3) 

22.6 

22.8 

23.3 

0.9 

RVDW(16,4,1.14) 

21.1 

21.3 

25.9 

SW-SSQME(16,4,0.3) 

31.2 

31.2 

31.8 

Tbl.  6.7.2  Comparison  of  the  ISNR,  na~/2  in  dB,  between  SW-SSQME 

and  RVDW  detectors. 
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Comparison  between  the  SSQME  test  without  reference  samples,  Fig.  5.20,  and  the 
RVDW(16,0,1.14)  is  less  conclusive.  At  against  f  i-f  2  noises,  the  perfor¬ 

mance  is  roughly  the  same.  For  /  3  at  this  level,  the  SSQME  detector  is  better,  where 
the  SNR  loss  increases  with  P^.  At  Py.^=10‘®,  the  P^  curves  are  totally  different.  For 
/  1  and  /  2,  the  curves  cross  at  about  P^  =0.4,  where  the  SSQME  is  better  for  higher 
values  and  the  difference  increases  with  P^.  At  f  3,  SSQME  is  substantially  better  for  all 
>0.05.  At  Pfa=10r^,  the  situation  is  even  more  complicated,  but  in  general,  for 
Pd  >0-9)  the  one-sample  SSQME  is  always  better  and  thus  should  be  preferred,  at  least 
for  the  relatively  restricted  contaminations  against  which  it  was  simulated  (e<0.1). 
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Fig.  6.7.4  RVDW  test,  Gaussian  noise,  convergence  with  n  and  M . 


u.uu  S.OO  10.00  15.00  20.00  25.00  30.00 

INTEGRATED  SNR  IN  DB 


-  160  - 


Fig.  8.7.6  IS-RVDW  test,  /  i  to  /  4  noises,  n  =16. 
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Fig.  8.7.7  Comparison  of  lS-\\%  VDW  and  R\T)W  tests,  /  4  mixture  noise,  n  =16. 


c 

o 


-  161  - 


Fig.  6.7.8  1S-R\TD\V  test,  /  i  to  f  ^  noises,  n  =32. 
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Fig.  6.7.9  IS-RATDW  test,  /  i  to  /  4  noises,  n  =32. 
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Fig.  6.7.10  2S-RVDW  test,  /  i  to  /  4  noises,  n  =32,  M  =2 
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Fig.  6.7.11  Comparison  of  \T)W  and  R\TD\V  tests,  /  3  mixture  noise,  n  =32. 
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Fig*  6.7.12  Comparison  of  ISAVX,  \T)W  and  RVDW  tests,  /  4  mixture  noise,  n  =32. 
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Fig.  6.7.13  IS-RVDW  test,  /  j  and  /  4“  /  e  noises,  n  =32,  k  =0.862. 
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Fig.  6.7.14  Comparison  of  IS-WOC,  VDW  and  RVDW  tests,  Gaussian  noise,  7i  ^32. 


,D0  5. DO  10. OD  15.00  20.00  25.00 

INTEGRATED  NOMINAL  SNR  IN  DB 


30.00 


.15  Comparison  of  VDW  and  RVDW  tests,  /  5  mixture  noise,  r?  =32. 


F  (5)  NOISE  :  j 

NlSIGNClL  SflMPLESI=32 
N^M  REF .  SAMPLES, M=  0 
NR-  2500 
PFA-  I  0*^^  (-4) 


^  J 


,00  5.00  10.00  15.00  20.00  25.00 

INTEGRATED  NOMINAL  SNR  IN  DB 


30.  00 


-  165  - 


Fig.  6.7.16  Comparison  of  IS-WOC,  VDW  and  R\T)W  tests,  /  g  mixture  noise,  n  =32. 
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Fig.  6.7.17  1S-RM)W  test,  /  j  to  /  4  noises,  n  =64. 
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Fig.  6.7.18  IS-RVDW  test,  /  i;  /  4,  /  7  and  /  g  noises,  n  =64. 
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Fig.  6.7.20  Comparison  of  IS-WX,  VDW  and  RVDW  tests,  /  g  mixture  noise,  n  =64. 
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Fig.  6.7.22  IS-RVDW  Doppler  test,  /  ^  to  /  4  noises,  n  =32,  A/  =0.25. 
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Fig.  6,7,23  1S-R\T)W  Doppler  test,  /  i  to  /  4  noises,  n  =32,  A /  =0.50. 
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Fig.  6.7.24  2S-RVDW  test,  /  i  to  /  4  noises,  n  =16,  M=4 
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Fig.  6.7.25  SW- a  trimmed  test,  /  i  to  /  4  noises,  n  =16,  M  =4 


c 

c 


-  170  - 


6.8  Overview  and  Comparisons  of  Different  Robust  Detectors 

In  Chapters  4  through  6,  different  algorithms  for  robust  detection  of  fading  nar¬ 
rowband  signals  have  been  developed.  They  vary  in  implementation,  complexity  and  per¬ 
formance;  thus,  a  comparison  arnong  them  is  appropriate. 

Clearly,  the  SSQME  test  of  Chap.  4  is  the  least  attractive,  and  should  be  used  only 
when  the  variance  of  the  nominal  noise  is  stationary  and  known  precisely.  Even  then, 
the  slightly  more  complicated  sliding-window  (SW)  version  of  Section  5.2  will  perform 
better  for  values  around  0.5. 

With  the  other  tests,  none  is  uniformly  best,  and  the  choice  would  depend,  mostly, 
on  the  environmental  conditions.  The  major  factors  that  have  to  be  considered  are  the 
availability  of  noise  reference  samples  (or  the  validity  of  the  assumption  that  the  target 
and  reference  samples  are  identically  distributed  under  and  how  reliable  is  the 

assumption  of  independent  samples.  Since  even  under  i.i.d.  conditions  the  exact  prefor- 
mance  depends  on  many  factors  [n  ,  M,  the  desired  Pf^  and  P^,  the  maximum  e),  only 
qualitative  conclusions  will  be  drawn  here. 

Complexity 

Implementation  of  the  SW  or  one-sample  (IS)  tests  with  M-estimators  require,  in 
general,  more  operations  than  with  L-  or  R-  statistics.  The  complexity  is  roughly  the 
same  for  the  SW  and  the  IS  tests  (either  with  the  R-  statistics  or  with  the  MAD  scale 
estimator.  Section  5.2),  if  the  need  for  two  orderings  of  the  data  in  the  IS-SSQME  is  not 
counted.  The  two-sample  (2S)  R-test  requires  considerably  more  computations,  and  is 
more  sensitive  than  the  IS  R-  (in  an  homogeneous  environment)  only  when  n  <32.  Sub¬ 
stantial  reduction  of  complexity  is  possible  with  the  single-sweep  (SS)  R-  Doppler  test 
(Section  6.6.2),  which  is  the  only  one  that  preserves  the  O  {n  logrz )  of  the  conventional 
periodogram  (FFT)  test.  (Its  small  sample  performance,  though,  has  not  been  simulated 
yet). 
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Compatibility  with  non-homogeneous  environment. 

The  analysis  in  this  thesis  has  been  restricted  to  the  cases  where  the  target  and 
reference  samples  are  identically  distributed.  But,  heuristic  assessment  can  be  made  for 
spatial  non-stationarity.  In  radar  systems,  the  major  sources  for  it  are  the  following:  a) 
The  noise  power  varies  as  a  function  of  the  type  of  terrain  from  which  the  clutter  echoes 
are  reflected.  This  factor  is  less  significant  if  the  terrain  is  fairly  uniform  (e.g.,  non  urban 
areas)  and  the  range  extent  of  the  reference  window  is  relatively  short  (say,  0.5  km). 
High  range  resolution  (say,  50  meters),  and  reasonably  small  M  (say,  M=10)  are 
sufficient.  Moreover,  in  any  reasonable  coherent  radar  processing,  clutter  cancellation 
(i.e.,  whitening  by  means  of  notch  filtering  the  clutter  spectral  peak)  is  performed  prior 
to  other  operations.  Properly  designed,  the  clutter  residues  are  at  about  the  thermal  and 
quantization  noise  levels,  and  consequently,  the  influence  of  non-stationarity  is  reduced, 
b)  “Edges”.  Sharp  increases  in  the  clutter  levels  are  encountered  on  the  boundary  of 
different  terrains  (e.g.,  when  mountain  ranges  touch  the  seashore  line).  The  major  prob¬ 
lem  with  edges,  is  that  they  cause  a  false  alarm  whenever  the  window  sweeps  over  the 
edge.  Again,  clutter  prewhitening  reduces  this  phenomena,  c)  An  echo  from  another  tar¬ 
get  within  the  reference  window,  is  common  in  dense  target  situations.  Closely  spaced 
targets  (range  separation  smaller  than  M)  cause  a  capture  phenomena;  even  when  the 
SNR  is  very  large  for  both  targets,  and  they  are  roughly  equal,  is  often  limited  to 
very  small  values. 

The  various  IS  tests  are  clearly  immune  to  any  non-homogeneity,  as  no  reference  is 
used.  The  sensitivity  of  the  IS  R-  test  is  lower  by  several  dB’s  than  that  of  the  test  with 
cv— trimmed  and  MAD  estimators  when  n  <C32  and  e^O.l.  However,  this  disadvantage  is 
compensated  by  the  DF-CFAR  operation  of  the  first  for  all  sample  sizes,  and  even  for 
nonsymmetrical  noise  density.  Moreover,  for  very  large  contaminations  (say,  e>0.3)  and 
small  n  ,  the  R-  based  IS  test  has  better  sensitivity.  (Recall,  however,  that  phase  modu¬ 
lation  of  the  transmitted  pulse  train  is  required  for  the  R-  test). 
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When  is  the  SW  test  with  a-trimmed  estimator  better?  Its  sensitivity  in  a  homo¬ 
geneous  environment  is  considerably  better  (for  n  =16,  P^^O.Q)  provided 

M^6.  With  this  window  size,  however,  it  becomes  more  susceptible  in  a  non- 
homogeneous  environment  to  the  above  mentioned  phenomena.  When  the  type  a)  non¬ 
homogeneity  is  not  a  serious  problem,  it  is  relatively  simple  to  protect  the  SW  test 
against  the  undesired  effects  of  the  b)  and  c)  types.  Since  the  behavior  of  the  robustified 
SW  test  in  a  contaminated  noise  has  been  shown  to  be  very  similar  to  that  of  the  linear 
SW  test  in  Gaussian  noise  (where  the  only  difference  is  the  modification  of  the  effective 
SNR),  even  for  small  sample  sizes,  the  same  techniques  that  have  been  applied  before  are 
appropriate  [19-20].  The  cure  against  edges  is  to  modify  the  adaptive  threshold 
v)  in  Eq.  (5.5)  such  that  the  sample  averages  of  the  leading  and  the  trailing  halfs 
of  the  window  are  computed,  and  then  the  greatest  of  the  two  is  taken  as  the  adaptive 
threshold,  see  [19|.  The  capture  problem  caused  by  closely  separated  targets  is  treated 
by  censoring  from  the  reference  window  the  cell  of  largest  magnitude  (or  the  m  largest 
cells  if  m  targets  are  expected  to  be  within  the  window).  Censoring  and  “Greatest-of” 
detection  are  easily  combined  to  protect  against  both  types  b)  and  c)  of  non¬ 
homogeneity,  see  [20].  (The  threshold  multiplier  t{M)  is  different  in  the  combined  algo¬ 
rithm,  but  it  is  possible  to  compute  it  with  the  techniques  of  [20]). 

Detection  in  dependent  noise 

Throughout  the  thesis,  the  noise  samples  have  been  assumed  to  be  statistically 
independent.  Recently,  several  papers  have  treated  robust  detection  for  serially  depen¬ 
dent  noise  samples.  Poor  [75],  following  Portnoy’s[74]  work  on  robust  estimation  with 
dependent  noise,  considered  weak  dependency  resulting  from  a  first  order  moving  aver- 
age  model,  with  parameter  p  ►  0.  His  robust  M-detector  utilizes  a  linearly  corrected  ver¬ 
sion  of  Huber’s  soft  limiter,  and  is  capable  of  restoring  the  O  [p]  sensitivity  loss  which 
the  usual  M-detector  suffers  in  the  assumed  moving  average  noise.  Moustakides  and  Tho¬ 
mas  [76]  looked  at  a  more  general  structure  for  dependency,  namely  at  (^-mixing  noise 
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sequences.  They  have  proved  a  minimax  result  on  the  efficacy,  and  an  upper  bound  on 
P fa  >  class  of  limiter-correlator  detectors,  and  for  ^-mixing  noise  with  an  e-mix- 

ture  marginal  distribution.  Their  test  utilizes  a  null-  zone  modification  of  Huber’s  limiter. 
Although  these  results  are  theoretically  important  and  similar  structures  could  be  sought 
for  the  signal  models  in  this  thesis,  it  can  be  argued  that  this  would  not  be  the  best 
direction  to  proceed. 

First,  when  dependency  (in  radar  detection)  is  a  problem,  it  usually  is  a  strong 
dependency  ^  It  is  well  known  that  the  optimal  detector  for  dependent  Gaussian  noise 
first  prewhitens  the  noise,  and  then  correlates  it  with  the  known  signal.  Therefore,  the 
above  robust  tests  which  are  all  based  on  memoryless  operations,  are  far  from  being  the 
globally  optimal  structures.  (It  is  easy  to  demonstrate  that  a  memoryless  detection  when 
the  noise  is  Gaussian  but  strongly  correlated,  can  cause  huge  degradation).  Second,  the 
modifications  on  the  non-linearity  require  exact  knowledge  of  some  parameters  in  the 
dependency  structure;  in  particular,  the  mixing  parameters  {7„  }  in  [76,  Eq.  (4)]  are  very 
difficult  to  determine.  This  precise  knowledge  seems  to  contradict  the  spirit  and  purpose 
of  robust  statistical  processing. 

On  an  intuitive  ba^is,  it  seems  that  the  natural  way  to  attack  the  problem  is  first 
to  try  to  prewhiten  the  input  noise  (utilizing  adaptive  techniques  of  modern  spectral  esti¬ 
mation  [78]  if  the  autocorrelation  is  unknown),  and  then  to  apply  one  of  the  robust  tests 

^  In  radar  detection,  when  the  noise  is  dominated  by  ground  or  precipitation  clutter, 
there  is  a  very  strong  dependency  among  the  input  samples.  The  power  spectrum  (of  the 
discrete  autocorrelation)  is  usually  composed  of  a  relatively  flat  pedestal,  representing  the 
thermal  noise  and  clutter  echoes  the  are  received  through  the  antenna  sidelobes,  and  a 
very  high  and  narrow  peak,  that  corresponds  to  the  clutter  reflections  through  the  anten¬ 
na  mainbeam.  (When  weather  and  other  aerial  clutter  are  present,  they  produce  another 
high  and  somewhat  wider  spectral  peak,  that  is  shifted  by  the  average  Doppler  frequency 
of  the  producing  clouds  from  the  ground  velocity).  Prewhitening  of  this  colored  spectra 
type  is  pretty  simple  in  the  ground  clutter  case;  the  center  frequency  of  the  clutter  peak 
is  tracked  by  phase  lock  loops,  and  its  width  is  essentially  known  as  it  is  related  to  the 
antennas  beamwidth  and  its  scan  frequency.  Thus,  prewhitening  is  done  by  notch-filtering 
(“MTF  in  radar  terminology).  When  aerial  clutter  is  also  present,  it  can  be  whitened  by 
cascading  the  ground  clutter  canceller  with  another  phase  lock  loop/notch  filter  combina¬ 
tion.  Alternatively,  modern  adaptive  techniques  of  spectral  estimation  can  be  used  to  esti¬ 
mate  the  spectrum  and  to  control  the  parameters  of  the  prewhitening  filter  [78]. 
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for  independent  noise.  Since  prewhitening  is  a  reversible  operation,  an  optimal  procedure 
following  it  would  not  involve  any  loss,  see  [14,  Part  I,  p.  289],  Although  white  but  non- 
Gaussian  noise  is  not  necessarily  independent,  this  approach  might  be  acceptable.  (For 
estimation  of  a  location  parameter,  Martin  [77]  has  recently  suggested  that  this  is  the 
“proper”  approach  to  achieve  efficiency  under  dependence;  he  has  also  pointed  that  this 
is  the  proper  way  to  design  robust  tests). 

Now,  there  remains  the  question  which  of  the  structures  that  have  been  studied  in 
this  thesis  is  more  appropriate.  The  prime  objective  is  to  achieve  the  DF-CFAR  perfor¬ 
mance,  for  any  dependence  scheme.  Under  certain  conditions,  the  robust  M-  and  L- 
estimates  as  well  as  the  R-statistics  are  expected  to  be  asymptotically  Gaussian  distri¬ 
buted,  but  with  a  variance  different  from  the  i.i.d.  case.  The  variance  will  depend  (in 
addition  to  other  factors)  on  the  dependence  structure.  Proof  for  M-estimates  with  either 
a  first  order  autoregression  or  a  first  order  moving  average  scheme  was  given  by  Portnoy 
[74],  see  also  [77].  Wolff  et  al.  proved  it  for  linear  test  statistics  with  any  finite  order 
autoregression,  see  [79]  and  the  other  references  given  there. 

From  all  the  normalization  structures  studied  in  this  thesis,  the  SW  class  of  tests 
will  assure  DF-CFAR  as  long  as  the  in-phase  and  quadrature  statistics  are  asymptoti¬ 
cally  Gaussian.  Notice  that  the  various  R-tests  are  not  CFAR  anymore  under  depen¬ 
dence,  but  a  SW  version  of  the  IS  R-test  is  easy  to  construct.  Simply,  the  fixed  thres¬ 
hold  in  the  IS  version  of  Eq.(6.3.4)  is  replaced  by  the  sample  mean,  taken  over  the  refer¬ 
ence  window,  of  the  same  quadrature  statistic  which  is  computed  for  the  cell  under  test. 
‘Studentizing”  the  2S  or  SS  rank  tests  is  also  possible,  following  the  ideas  in  [79],  but 
demands  a  substantial  increase  in  the  complexity.  (Basically,  an  estimator  of  the  vari¬ 
ance  of  the  “main”  test  statistic  is  needed  to  set  the  threshold).  Since  even  the  i.i.d.  2S- 
R  test  is  more  complex  than  the  IS,  and  is  more  sensitive  only  for  quite  small  sample 
sizes,  it  seems  that  the  IS  is  a  better  choice,  especially  under  dependence.  Sacrificing  the 
computation  efficiency  of  the  SS  R-  Doppler  test,  however,  is  a  disadvantage. 
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Although  a  deeper  and  thorough  analysis  of  the  suggested  scheme  is  beyond  the 
scope  of  this  work,  a  simulation  of  a  single  example  was  carried  to  demonstrate  the  vali¬ 
dity.  We  assumed  that  following  the  prewhitening  the  noise  can  be  modeled  as  a  first 
order  moving  average  process  as  in  [75],  where  the  i.i.d.  samples  from  which  the  moving 
average  noise  is  derived  have  an  e-  mixture  distribution.  The  moving  average  parameter 
was  /7=0.1,  and  the  test  was  the  SW  (n=16,  Af=4)  with  a— trimmed  estimators.  It  was 
found  that  the  tails  of  the  distribution  of  the  test  statistic  under  Hq  were  somewhat 
longer;  however,  increasing  the  trimming  ratio  to  q;=0.4  kept  the  Pf^  very  close  to  the 
asymptotic  prediction.  (See  Fig.  6.7.26.)  Actually,  the  curves  are  very  similar  to  Fig.  5.9. 
The  Pi  curves  also  turned  out  to  be  very  similar  to  those  of  Figs.  5.17-5.19,  with  the 
same  insensitivity  to  the  marginal  distribution;  see  Fig.  6.7.27.  Here,  the  dashed  curves 
correspond  to  the  computed  P^  of  the  UMP  test  when  the  noise  is  a  MA  Gaussian  with 
p=0.1;  the  other  are  for  MA  noises  with  the  /  i-  /  4  generating  distributions  of  Section 
5.4. 

Naturally,  the  dependence  causes  a  sensitivity  loss  compared  to  the  i.i.d.  case. 
This,  however,  is  roughly  identical  with  that  suifered  by  the  optimal  detector  for  Gaus¬ 
sian  noise  with  the  same  correlation  matrix.  The  latter  can  be  shown  to  multiply  the 
SNR,  A  ^/2,  by  instead  of  by  n  ,  where  R  is  the  correlation  matrix.  For  the 

»>y 

assumed  model,  this  loss  is,  with  p=0.l  and  n=16,  13.6/16=0.7  dB,  and  the  dashed 
curves  in  Fig.  6.7.27  are  shifted  by  this  amount  from  the  e=0  curves  of  Fig.  5.19.  For 
the  longer  tailed  distributions  f  2~  f  3>  there  is  an  additional  sensitivity  loss  of  about  0.8 
-  1  dB.  It  is  mostly  attributed  to  the  increased  estimator’s  variance  when  a  is  raised 


from  0.3  to  0.4. 
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Fig.  6.7.26  of  a  SW-  a-trimmed  test,  under  first  order  MA  dependence. 


O 
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Fig.  6.7.27  of  a  SW-  a-trimmed  test,  under  first  order  MA  dependence. 
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7.  ROBUST  DIGITAL  M-ARY  COMMUNICATION  RECEIVERS 
7.1  Introduction 

It  appears  that  optimal  digital  communication  theory  is  still  mostly  a  theory  of 
additive  Gaussian  noise.  A  survey  of  both  classical  and  modern  texts,  cf.  [65-68],  does 
not  reveal  any  significant  results  for  non-Gaussian  noises.  This  is  true  for  both  aspects  of 
the  communication  problem:  optimal  demodulation  and  optimal  signal  selection.  On  the 
one  hand,  optimal  digital  demodulation  is  analytically  tractable  for  arbitrary  compli¬ 
cated  signal  sets  received  in  additive  colored  Gaussian  process.  Moreover,  explicit 
expressions  or  tight  bounds  on  the  error  probability  exist,  and  it  is  therefore  possible  to 
find  optimal  signal  sets.  On  the  other  hand,  information  theory  supplies  us  with  a  closed 
form  expression  for  the  channel  capacity  of  the  additive  white  Gaussian  noise  channel 
(AWGN),  against  which  the  efficiency  of  any  signaling  scheme  can  be  compared. 

The  tractability  of  both  aspects  of  the  problem  is  attributed  to  the  remarkable  pro- 

b 

perty  of  a  Gaussian  random  process  n{t):  any  integral  of  the  form  f  f  [t)n{t)dt  is  a 

a 

Gaussian  random  variable.  Hence,  infinite  dimensional  decision  problems  are  made 
countable  by  such  techniques  as  the  Karhunen-Loe’ve  expansion,  projection  on  the  basis 
set  spanned  by  the  signal  set  to  obtain  sufficient  statistics,  or  whitening  augmented  by  a 
reversibility  argument.  Non-Gaussian  processes,  on  the  other  hand,  pose  formidable 
difficulties.  For  the  optimal  detection  part  of  the  problem,  the  receiver  should  maximize 
the  posterior  probability  that  a  signal  s„,  {t )  has  been  transmitted.  However,  for  cases  of 
prime  interest  we  do  not  have  close  form  expressions  for  the  posteriors.  Consequently, 
the  optimal  signal  set  for  given  constraints  can  not  be  found.  While  numerical  algo¬ 
rithms  for  computation  of  the  channel  capacity  of  continuous  channels  and  source  alpha¬ 
bets  have  been  derived  [67],  we  do  not  know  of  any  closed  form  expressions  for  non- 
Gaussian  cases.  Hence,  there  is  no  baseline  against  which  sub-optimal  receivers  could  be 
compared. 


-  178  - 


As  in  binary  hypothesis  testing,  we  would  expect  that  various  communication 
environments  consist  of  non-Gaussian  noises.  These  include,  to  name  a  few:  RFI  or  jam¬ 
ming  in  radio  communication,  other  non-synchronized  messages  in  multiple-access  chan¬ 
nels,  under-water  communication,  etc.  We  thus  might  wonder  why,  in  contrast  to  binary 
hypothesis  problems,  robust  digital  communication  has  not  been  studied  so  far.  One 
explanation  is  the  previously  mentioned  analytical  difficulties.  The  second  is  that, 
perhaps,  optimal  communication  is  not  the  focus  in  practice.  For  instance,  SNR’s  of  70 
dB  are  typical  in  the  American  telephone  network.  Yet  another  explanation  arises  from 
information  theory  -  for  fixed  signal  energy  and  additive  noise  power,  the  capacity  is  the 
lowest  for  the  AWGN  channel  -  see  Theorem  7.4.3.  in  [66].  Hence,  if  someone  compares 
the  performance  of  a  given  receiver  only  against  the  input  SNR  he  would  not  be 
alarmed.  As  a  consequence  of  Shannon’s  coding  theorem,  however,  we  know  he  could 
have  done  better  had  he  utilized  some  knowledge  on  the  non-Gaussian  distribution  (this 
will  be  demonstrated  in  the  sequel).  Finally,  some  ad-hoc  devices  in  the  receiver,  which 
usually  are  not  designed  as  an  outcome  of  statistical  analysis  (e.g.,  squelch  circuitry  in 
FM  receivers,  Dolby  in  audio  tape  decks,  etc.)  might  combat  successfully  some  of  the 
non-Gaussian  interferences. 

In  this  chapter  we  attempt  a  modest  first  step  towards  robust  digital  communica¬ 
tion  for  additive,  white  non-Gaussian  (AWNGN)  channels.  To  circumvent  the  analytical 
difficulties  we  study  only  the  discrete  time  problem,  and  we  limit  the  scope  to  the  sim¬ 
plest  modification  of  the  discrete  time  white  Gaussian  process,  i.e.,  we  modify  only  the 
first  order  distribution.  Within  this  framework,  we  show  that  there  is  a  close  relation¬ 
ship  to  the  problem  of  robust  estimation  of  a  location  parameter  by  means  of  M- 
estimators.  Further,  by  keeping  the  structure  of  the  bank  of  correlators  receiver,  but 
replacing  the  correlators  by  limiter-correlators,  a  minimax  error-probability  is  obtained 
when  the  limiter  is  Huber’s  soft  limiter  function.  This  is  true  for  an  arbitrary  signal  set. 
Similar  results  are  obtained  for  noncoherent  communication,  where  the  receiver  is 
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robustified  by  inserting  limiter-correlators  in  the  in-phase  and  quadrature  channels  of  the 
quadrature  square-law  receiver. 

When  it  is  desired  to  reduce  the  sample  size,  we  propose  a  similar  structure  with 
generalized  M-estimators  replacing  the  limiter-correlators.  Specific  important  cases 
(PAM,  PSK  and  QAM)  yield  identical  asymptotic  results.  The  same  is  shown  true  for 
the  orthogonal  and  uniform-distance  signal  sets,  provided  we  replace  the  one-dimensional 
M-estimators  by  the  solutions  of  an  equivalent  robust  regression  problem.  Based  on  our 
Monte-Carlo  experience  with  the  binary  problems  in  previous  chapters,  we  suspect  that 
the  latter  implementation  is  superior  to  the  first. 

7.2  Robust  M-Ary  Coherent  Receivers:  Limiter- Correlator  Implementation 

In  the  M-ary  digital  communication  problem,  at  each  time  interval  ^G[0,  T],  the 
transmitter  sends  one  of  the  signals  ASjn{t)j  corresponding  to  one  of  equally  probable  M 
message  symbols  m  6{0,1,...,M}.  The  signals  are  characterized  by  their  energies  6,-  ^  and 
the  normalized  inner  products  p^j 

T 

Jsi{t)sj{t)dt  =  Ty/^Pij  (7.1) 

0 

where  —  1  and  |  p^j  |  <1.  Except  for  pulse  amplitude  modulation  (PAM),  we 
assume  6,*  =£V^ . 

The  coherently  received  signal  at  the  output  of  an  AWNGN  waveform  channel  is 

r{t)  =  As^{t)  +  w{t)  (7.2) 

where  ti^(0  is  a  band-limited,  zero-mean,  wide-sense  stationary  (w.s.s.)  random  process, 
not  necessarily  Gaussian,  with  spectral  density  S^(f  )  =  CNq/2  for  [  /  |  <B  and  0 
otherwise.  We  consider  the  “nominal”  power  spectral  density  Nq/2  as  known,  but  C  >0 
as  unknown.  In  particular,  we  will  be  interested  in  heavily  tailed  marginal  noise  distribu- 

^  In  our  notation,  5,(f )  and  6,  are  dimensionless.  The  physical  energy  is  thus  given  by  A'^Te^. 
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tions,  for  which  C  »1  {C=l  for  Gaussian  ).  We  will  take  n=2BT  »1;  this  ensures 
that  the  received  signal  is  essentially  undistorted  as  well  as  that  the  noise  process 
approaches  a  white  noise  in  the  limit. 

In  order  to  avoid  some  very  tough  analytical  difficulties  that  are  inherent  to  any 
detection  problem  with  continuous  time,  non-Gaussian  processes,  we  will  limit  our  study 
only  to  discrete  time  processing 

The  received  waveform  is  time  sampled  at  the  Nyquist  rate  2B  =1/ At  ^  yielding 
ri=r{iAt),  Wi=w{iAt)  and  s^i=s^{{  At),  i  =2BT .  Obviously,  the  noise 

samples  are  uncorrelated.  We  make  the  following  crucial  assumption  on  the  class  of  noise 
processes  {w{t)}  to  be  considered  hereafter: 


If  the  {wi}  are  uncorrrelated  ,  they  are  statistically  independent  (p) 

It  is  justified  to  ask  if  the  postulated  class  contain  any  other  process  beside  the  Gaus¬ 
sian.  This  question  has  been  positively  answered  by  Martin  and  Schwartz  [69].  They 
have  constructed  the  so  called  quasi-mixture  Gaussian  process  (QMGP),  which  possess 
the  above  property  if  the  quasi-mixture  is  generated  from  two  independent  white  Gaus¬ 
sian  processes;  the  marginal  distributions  are  simply  e-mixtures  of  two  Gaussian 
p.d.f.’s.  ^  While  the  definition  and  construction  of  the  QMGP  might  seem  somewhat 
artificial  and  non  physical,  any  other  definition  of  a  w.s.s.  non-Gaussian  process  that 
does  not  obey  (p)  is  bound  to  involve  formidable  analytical  difficulties.  Hence,  even 

though  a  more  realistic  characterization  might  be  desired,  the  one  we  have  postulated 

”2 - ; - 

Admittedly,  there  are  two  major  drawbacks,  a)  In  some  cases,  it  is  simpler  and  more  economical 
to  implement  the  digital  communication  receiver  with  analog  processing,  e.g.,  the  matched  filter  in 
a  PSK  receiver  is  commonly  implemented  by  multiplying  (with  diodes)  the  input  signal  by  a 
sinusoid  at  the  carrier  frequency,  and  then  integrating  the  product  by  a  capacitor-operational 
amplifier  circuit,  b)  Even  when  the  number  of  samples  is  very  large,  the  performance  might  be  in¬ 
ferior  to  that  achieved  by  the  optimal  analog  processing.  Although  this  is  not  the  case  with  Gaus¬ 
sian  noise  and  the  bank  of  matched  filters  receiver,  we  can  not  claim  it  in  general  without  a  deeper 
analysis. 

While  it  does  not  seem  possible  to  realize  a  continuous  QMGP  exactly,  arbitrary  close  approxi¬ 
mations  can  be  realized  by  switching  between  two  independent  Gaussian  processes,  where  the 
switching  rule  is  based  on  a  nearly  white  Poisson  process.  At  each  time  increment  of  the  Poisson 
process,  the  first  Gaussian  process  is  selected  with  probability  e,  see  [69]. 


-  181  - 


will  serve  at  least  as  a  modest  first  step  towards  robustification  of  digital  receivers 
against  deviations  from  the  predominant  AWGN  channel  assumption. 

Without  loss  of  generality,  the  receiver  first  normalizes  the  input  by  the  square  root 
of  the  “nominal”  power:  =r,’ /(NqS We  now  have  a  multiple-hypothesis  detec¬ 

tion  problem,  where  on  the  m  hypothesis 

Hm  •  ^ni  =  ^n^mi  +  ;  2=1,..,,,^  m  =1,...,,M  (7.3) 

Here,  a^=A  /[N and  the  n,*  are  i.i.d.,  zero-mean,  with  variance  C 
and  a  symmetric  p.d.f.  f  .  As  B  and  n  -^oo,  (7.1)  is  replaced  by 

hm  —  X;  Smi  sii  =  \/e^  Cl  pi^  (7.4) 

n  — >oo  U 

We  restrict  our  attention  to  a  class  of  decision  rules  which  capture  the  structure  of 
the  (discrete  time)  optimal  receiver  for  the  AWGN  channel,  but  we  replace  the  linear 
correlation  by  a  nonlinear  one.  Namely,  we  decide  in  favor  of  if  =  max  , 

1  " 

S  ^  )^mi  ^  l,...,Af  ('^•^) 

^  j=l 

and  /(•)  satisfies  the  same  regularity  conditions  as  in  Chapter  4.  ^  Heuristically,  it  seems 
plausible  that  if  /  (•)  is  some  sort  of  a  limiter,  the  effect  of  heavy  tailed  noise  will  be 
reduced,  but  some  performance  loss  will  be  incurred  if  the  noise  is  Gaussian.  We  will 
show  that  if  the  marginal  noise  p.d.f.  /  belongs  to  a  class  of  distributions  for  which  a 
minimax  M-estimator  of  location  exists,  then  a  minimax  result  on  the  error  probability  is 
obtained  within  this  class  of  decision  rules. 

^  Notice  that  for  any  noise  process^  the  optimal  Bayesian  receiver  computes  posterior  probabilities 
and  decides  in  favor  of  the  largest,  but  we  can  not  claim  that  the  decision  variables  of  Eq.  (7.5)  are 
proportional  to  a  monotonic  function  of  the  posteriors  of  any  noise  process,  even  in  discrete  time 
and  asymptotically.  However,  if  we  measure  the  performance  only  with  respect  to  the  total  input 
SNR,  it  will  be  seen  that  the  performance  is  better  than  in  the  Gaussian  case.  Thus,  the  sub- 
optimal  class  which  requires  only  slight  increase  in  the  implementation  complexity,  is  quite  ade¬ 
quate. 
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We  now  turn  to  the  asymptotic  performance  analysis.  When  n  — »-oo,  the  [d^]  are 
jointly  Gaussian  distributed;  this  can  be  shown  for  bounded  /(•)  by  invoking 
Liapounoff’s  version  of  the  central  limit  theorem  (notice  that  the  summands  are  func¬ 
tions  of  n,  hence  the  simple  CLT  does  not  hold),  see  [64,  p  201],  For  linear  /(•),  the 
proof  is  elementary.  Hence,  computation  of  means  and  covariances  is  sufficient  for  the 
error  probability.  With  tedious  but  straightforward  computation  it  can  be  shown  that: 

lim  E(n  \H,)=A  (I  (7.6) 

n  -*oo  i V  0 

lim  Cov{n^l^d^,  n^l^di  \  Hk)  =  Ef  (7.7) 

n  -*oo 

lim  Var  \  H^)  —  E  t  (/')6  ^  (7.8) 

n  -*oo 

Since  the  signal  amplitude  is  always  0(72“^/^),  the  above  expressions  are  obtained  by 
means  of  a  Taylor  series  expansion.  These  expressions  are  similar  to  the  corresponding 
ones  for  the  matched- filter/Gaussian  noise  case,  except  for  the  E{1')  and  E{1^)  terms. 
Clearly,  calculation  of  the  error  probabilities  is  unchanged  if  the  decision  variables  are 
normalized  to  N(o,  [p^i])  according  to 

^  n^lU,-E{n^l^d,  \H,) 

di  Vi  =  - 

cr 

Thus,  the  error  probability  is  Pg  {k  )=1-Pg  {k  ),  where 

F,(^)  =  Fr{4=maxd.-  \  H,}  =  Pr  {\it ,  <v,+P^/\l ,  f  ){l-pi,)  \  H^}  (7.9) 

t 

In  that  way,  we  get  the  same  well  known  expressions  for  the  error  probability,  for  any 
signal  set  but  the  signal  to  noise  energy  ratio  (i.e.,  the  factor  €/Nq  in  Chapters 

7-9  of  [65]),  is  replaced  by  P(/,  /  ), 

A  -S' A/') 

=  f)  ,  G(/,/)J-^  (7.10) 

iVQ  jC/y*  ) 


We  see  that  the  error  performance  in  the  multiple  hypothesis  detection  problem  for  the 
AWNGN  channel  is  again  characterized  by  the  functional  G(/,  /  ),  and  all  the  results 
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on  robust  estimation  of  a  location  parameter  by  an  M-estimator  apply  here  as  well. 
From  Eq.(7.9),  it  is  seen  that,  for  arbitrary  signal  set,  the  probability  of  correct  decision 
is  a  monotonic  increasing  function  of  P(/,  /  ).  To  show  this,  denote  and 

notice  that  if  X2>Xi,  the  event  A={\/  i,  <Vk+Xi{l-pif^)  \  Hji. ,  Vj^}  is  included  in 
B  =  {V  i ,  Vi  +X2[l-pik  )  I  }j  since  >0.  Therefore  Pr  {A  )<Pr  {B  }, 

and  from  (7.9)  Pf.{X  Pr  {A  }dF  [v]^)<Pc{X<^,  Consequently,  whenever  a  minimax 
relation  on  G(/,  /  )  exist  in  some  class  {/GL}  and  {/  GP},  it  is  translated  into  a 
minimax  relation  within  the  above  class  of  decision  rules  based  on  the  limiter-correlators 
(7.5). 

For  the  matched  filters  receiver,  l[x)=x  and  G{1  ^  f  )=1/Ef  {x")=l/ C  ^  i.e.,  the 
inverse  of  the  total  noise  power  density.  Thus,  if  one  only  measures  the  noise  power  at 
the  signal  bandwidth  and  does  not  have  any  knowledge  of  the  shape  of  the  noise  distri¬ 
bution,  the  error  probability  will  be  exactly  what  one  would  have  expected  from  the  clas¬ 
sical  theory;  hence,  one  would  not  have  any  reason  to  be  alarmed.  This  might  be  an 
explanation  why  communication  in  a  non-Gaussian  noise  has  not  received  any  attention 
so  far.  However,  it  is  known  that  for  all  distributions  of  equal  variance,  the  lowest 
G  [I  j  f  )  is  achieved  by  the  Gaussian,  and  the  pair  of  the  Gaussian  distribution  and  a 
linear  /(•)  constitute  a  saddle-point,  see  [3,  p.  83].  As  a  consequence  of  the  monotonicity 
of  the  error  probability  with  respect  to  the  effective  signal  to  noise  ratio  P,  the  perfor¬ 
mance  of  the  matched  filters  receiver  is  a  lower  bound;  any  other  receiver  which  utilizes 
some  knowledge  on  the  deviation  from  Gaussianity  (within  the  postulated  class  of 
AWNGN  processes)  will  perform  better.  In  particular,  if  we  assume  that  the  marginals 
belong  to  a  nominally  Gaussian  e-mixture  family,  /  (rr  )=(l-e)^  (x  )+e/i  (a: ),  the  decision 
rule  based  on  the  limiter-correlator  (7.5)  with  Huber’s  function  Iq{x  ;  -AT,  K)  of  Eq.  (4.9) 
is  the  minimax  solution 

^  When  k  =0,  correlates  the  signal  with  the  sign  of  the  observations.  This  scheme  was  con¬ 
sidered  by  Beaulieu  and  others  [72],  in  the  binary  case,  for  reasons  of  reducing  the  number  of  multi¬ 
plications. 
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Let  us  write 

^  ^  ^  /  )  ,  C-  =  (1-^)  + 

where  is  the  variance  of  the  contaminating  density.  Thus,  the  first  factor  in  P{lo,  f  ) 
is  the  true  signal  to  total  noise  energy  ratio,  while  the  second  factor  represents  a  gain 
factor  which  increases  the  effective  SNR  P  for  non-Gaussian  noise.  For  fixed  e,  G{Iq,  f  ) 
is  bounded,  but  C  can  be  arbitrary  large  when  the  contaminating  noise  is  heavy  tailed. 
Therefore,  the  performance  improvement  with  respect  to  the  matched  filters  receiver  and 
Gaussian  noise  of  equal  power(G)  can  be  arbitrary  large!  As  an  example,  when  h  is  also 
Gaussian  and  e=0.1,  gains  of  1.4,  8.8  and  17.9  dB  are  obtained  with  <7^=3,  10  and  30, 
respectively.  {G{Iq,  f  )  are  given  by  the  inverse  of  the  variances  in  Table  5.1). 

The  last  conclusion  can  also  be  interpreted  by  an  information-theoretic  viewpoint; 
with  the  postulated  AWNGN  channel,  essentially  error-free  communication  can  be 
achieved  at  transmission  rates  much  higher  than  the  channel  capacity  of  the  equivalent 
AWGN  channel.  Consider,  for  example,  an  orthogonal  signal  set  -  /9,-y=5,y.  If  the  noise 
density  /  is  known,  the  optimal  performance  for  the  assumed  class  of  decision  rules  is 
obtained  for  / (/  )—-/'//  ,  and  then  G{l{f  ),  /  )=/(/  ),  Fisher’s  information.  The 
data  transmission  rate  is  R=logoM/T  and  S=A‘^e  is  the  average  signal  power.  Paral¬ 
leling  the  derivation  of  [65,  Chap.  8.4],  we  obtain  that  the  limit  of  the  error  probability 
for  large  M  is 


0  if  -^nf)  >  1~2 


(7.11) 


Since  S' is  the  capacity  of  the  infinite  bandwidth  AWGN  channel,  and  since  for 
all  distributions  of  fixed  variance  /(/  )  is  minimized  by  the  Gaussian,  the  previous  claim 
is  clear.  ®  As  a  consequence  of  Shannon’s  coding  theorem,  this  derivation  shows,  in  fact, 

g 

Indeed,  theorem  7.4.3  of  Gallager  [66]  states  that  for  a  given  noise  variance,  and  for  constrained 
signal  energy,  Gaussian  noise  is  the  worst  additive  noise  from  a  capacity  viewpoint. 
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that  the  capacity  of  the  postulated  AWNGN  channel  is  lower  bounded  by 

/(/  )-5/(iVoln2)  ^ 

As  a  final  comment,  let  us  note  that  if  the  non-Gaussian  noise  is  indeed  a  QMGP, 
identical  results  can  be  shown  for  the  continuous  time  version  of  (7.5).  The  derivation 
parallels  that  of  Martin  [70],  based  on  theorems  of  Rosenblatt  [71]  on  the  asymptotic 
normality  of  some  integral  functionals  of  non-Gaussian  processes. 

7.3  Robust  M-ary  Coherent  receivers:  Implementation  with  generalized  M- 
estimators 

If  the  channel  is  severely  band  limited,  or  if  it  is  desired  to  reduce  the  number  of 
computations  in  a  digital  implementation  so  that  samples  are  taken  at  a  lower  rate,  n 
might  not  be  very  large,  and  the  input  SNR  needed  for  small  error  probabilities  is  bound 
to  be  relatively  high.  In  this  case  the  limiter-correlator  structure  might  cause  perfor¬ 
mance  losses.  Motivated  by  the  very  good  small  sample  performance  of  M-estimators, 
and  observing  that  dk ,  when  properly  normalized  by  the  energy,  is  an  unbiased  estima¬ 
tor  of  the  signal,  it  seems  appropriate  to  generate  the  decision  variables  as  generalized 
M-estimators 

4  =  arg  {T,Skil{xi-s,,idk)  =  0}  k  =  1, . M  (7.12) 

i  =  l 

Unfortunately,  even  the  asymptotic  performance  is  difficult  to  compute,  for  an  arbitrary 


^  The  direct  strategy  to  compute  capacity  for  an  arbitrary  additive  channel  was  derived  in  [66].  If 
X  is  the  channel’s  input  and  y  its  output,  the  constraint  on  the  input  energy  is  translated  into  a 
constraint  on  the  output  energy.  Then,  finding  the  capacity  is  reduced  to  maximizing  the  output 
entropy  with  respect  to  Pyiv),  for  the  given  constraint;  afterwards,  it  is  needed  to  find  a 

distribution  Px{^)  yielding  the  maximizing  Py(y).  Unfortunately,  the  maximizing  Pyiy)  is  found 
to  be  Gaussian,  therefore  this  approach  yields  results  only  for  an  additive  Gaussian  noise. 
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signal  set.  Indeed,  when  Hf.  is  true,  is  asymptotically  Gaussian  with 


lim  Ef  (4  \Hk)  =  a^e 

n  -»oo 


lim  Var  |  )  = 

n  -*>00 


•2BT 


This  can  be  shown  to  be  a  special  case  of  robust  M-estimation  of  a  one  dimensional 
regression  problem,  for  which  asymptotic  normality  has  been  proven  by  Huber,  see  [3], 
Moreover,  it  can  be  shown  that  each  of  the  ,  m  converge  in  probability  to 

^Pmk  ■  However,  it  is  difficult  to  show  that  the  vector  of  decision  variables  is  jointly 
normal  with  covariances  corresponding  to  Eq.  (7.7).  Presumably,  one  might  parallel  the 
derivation  following  Eq.  (D.ll)  of  App.  D.  Furthermore,  to  reduce  the  complexity,  a  “one 
-  step  ”  implementation  might  be  quite  efficient,  as  has  been  shown  in  Chap.  5  for  the 
simpler  binary  problem  with  a  constant  signal.  The  “one  -  step”  decision  variables  are 
the  first  iteration  of  Newton’s  method,  computed  according  to 


S  ^ki  ^  ^ki  ^k  ) 

dk  =  4“  +  -  (7.13) 

E  Ski  I  '{^i  -Ski  dk) 
i  =  l 

where  dj^  is  given  by  (7.5). 

For  several  particular  cases  which  are  widely  applied,  it  is  possible  to  exhibit  the 
asymptotic  performance,"  these  are  the  PAM,  PSK  and  QAiM  signaling  schemes. 


Robust  PAM 

Here  s,„j-  s,- ,  =27n  — 1— A/ .  Of  course,  the  energies  are  not  equal.  Now,  as  in 

the  classical  Gaussian  case,  only  a  single  decision  variable  is  needed 

n 

d  =  arg  {  2  Si  I  {xi -dsi )  ==  0  }  (7.14) 

t  =1 

and  the  decision  is  made  in  favor  of  one  of  the  M  possible  amplitude  levels  which 

is  nearest  to  d  (Of  course,  for  PAM  the  channel  attenuation  on  the  transmitted  signal 
must  be  precisely  known).  Clearly,  the  error  performance  is  closely  related  to  the 


-  187  - 


problem  of  estimating  the  one  dimensional  regression  parameter  a„A„  for  the  observa¬ 
tions  Xi=a„A^  Si+Tii .  As  n— »-oo,  n^^\d-a„A„)  is  zero  mean  Gaussian  with  variance 
Hence,  as  in  the  Gaussian  case  (cf.  [68]),  the  error  probability  is 
given  by  P,  =  2((M-l)/M)4>(-2P(/ ,  /  )i/2).  As  before,  monotonicity  with  respect  to 
the  generalized  SNR  leads  to  the  previous  conclusions  on  robustness. 

Robust  PSK  and  QAM 

For  PSK,  the  transmitted  signal  is 

j  27rfm  -1) 

=  =  e  (7.15) 

As  usual  for  PSK,  we  assume  that  the  noise  w{t)  is  a  narrowband  bandpass  process 
around  the  carrier  /j,  i.e.,  S^[f  )=CN q/2  for  |  /  ±/c  |  ^-8/2  and  zero  otherwise,  cf. 
[68].  We  also  assume  /<,  >B  l2»ll2T  so  that  w{t)  can  be  represented  by  its  in-phase 
and  quadrature  low-pass  processes.  Following  demodulation  by  the  in-phase  and  quadra¬ 
ture  mixers,  amplitude  normalization  and  time  sampling,  the  analog  to  Eq.  (7.3)  is 

■  ^ni  ~  Ji;,,-  ?/„,•  =  a„  Im  [Uj^)  -f  2=1, . n—BT  (7.16) 

With  the  previous  assumptions,  the  marginal  distributions  of  the  n,i  }  are  identical, 

and  all  the  r.v.’s  are  uncorrelated.  We  assume  the  analog  for  (p),  which  in  this  case 
requires  also  that  if  is  uncorrelated  with  ,  they  are  statistically  independent. 
Actually,  similar  to  the  derivation  of  App.  D,  it  suffices  to  require  that  the  distribution 
of  Tici  and  n^i  is  circularly  symmetric.  Now,  it  is  necessary  to  compute  two  M-estimators 
as  decision  variables 

=  arg  {  f)  / {xi-d^  )  =  0  }  dj,  =  arg  {  f)  /(y.-dj  =  0  }  (7.17) 

i  =1  i  =1 

From  them,  an  estimate  of  the  phase  is  computed  as  tan“^dy /d^ ,  and  a  decision  in 
favor  of  is  made  if  d„,  =27r(m  — 1)/M  is  nearest  to  0.  d^  and  dy  are  asymptotically 
jointly  Gaussian  and  independent,  with  common  variance  Ef  {l‘)/nEf{l').  Hence,  the 
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expression  for  the  error  probability  is  identical  to  the  Gaussian  case  where  the  SNR  is 
replaced  by  P{1,  f  ),  with  6=1.  Consequently,  the  previous  conclusions  on  robustness 
are  valid.  Moreover,  one-step  versions  of  (7.17)  have  identical  asymptotic  distribution; 
hence,  the  simplest  implementation  has  the  identical  robustness  property.  Furthermore, 
since  for  PSK  the  equivalent  lowpass  signal  is  constant,  identical  results  are  obtained 
(with  reduced  computation  load)  if  and  dy  are  replaced  by  a-trimmed  estimators,  as 
in  Chap.  5.2. 

It  is  well  known  that  the  bandwidth  efficiency  of  the  PAM  and  PSK  systems  is 
penalized  by  the  required  high  SNR,  and  that  a  combination  of  multiple  signal  ampli¬ 
tudes  and  phases  in  quadrature  amplitude  modulation  (QAFl)  reduces  the  required  SNR 
substantially.  Since  in  QAM  Sj„(t)=Re  +jBy„)exp[j  2it f  ^  t)],  a  receiver  which 
decides  in  favor  of  the  point  (A„  ,8^ )  which  has  the  shortest  Euclidean  distance  to 
(dj.  ,dy)  of  (7.17),  will  have  the  same  robustness  properties  as  the  robust  PSK  receiver. 

Orthogonal  and  equal-distance  signals 
Let  us  rewrite  Eq.  (7.3)  as 

M 

^ni  ==  E  ^m^mi  +  (7.18) 

m=l 

In  this  way,  we  can  cast  the  problem  as  a  regression  problem,  where  we  first  estimate  the 
vector  of  unknown  parameters  d' =(0i,....,0^)  and  then  we  decide  if  is  the  larg¬ 
est.  The  motivation  for  this  modification  is  that  we  can  utilize  some  results  from  the 
theory  of  robust  regression;  namely,  the  solution  of  the  system 

n  M 

E  Spi^o(a;.-  E  =  0  p=l, . ,M  (7.19) 

t  =  1  m  =  1 

is  the  minimax  optimal  solution,  see  [3].  On  the  negative  side,  in  general  the  coupled 
equations  might  require  more  computations  than  the  solution  of  the  M  one  dimensional 
problems  (7.12).  The  second  disadvantage  is  that  we  do  not  utilize  the  prior  knowledge 
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on  0  {i.e.,  one  of  the  components  is  1  and  the  other  are  0);  this  might  result  in  perfor¬ 
mance  losses.  It  turns  out  that  for  the  two  important  cases  of  orthogonal  and  uniform- 
distance  signal  sets  (  Pij=p  for  i^j ,  -1/{M -l)<p<\),  no  losses  are  incurred.  To  show 
this  let  us  define  the  5  matrix  whose  m column  is  the  signal  vector  on  the  m 

hypothesis  . )‘ ,  and  .  From  (7.4),  S=neR^,  where  Rj  is 

the  matrix  with  elements  p^j .  Huber’s  condition  for  joint  asymptotic  normality  of  the 
solutions  of  (7.19)  is  that  h  =maxiy,-,- — >  0  as  n  — >•  oo.  For  the  uniform-distance  set,  it 

i 

can  be  shown  that  Rf^  has  equal  diagonal  terms  {{M-2)p+l)/D  , 
D  =l+p{M-2)-p~{M-l)>0,  and  equal  olf-diagonal  terms  -p/D  .  Thus,  it  can  be  shown 
that,  with  G  =max5j,- , 

h  <  max{  ^  Ml-WG’  } 

n  6D  n  €.D 

This  is  also  valid  for  the  orthogonal  set  where  p=0.  Hence,  if  M  is  not  too  large,  the 
condition  is  satisfied,  the  solutions  of  (7.19)  converge  in  probability  to  the  true  values, 
and  has  the  covariance  matrix 


Ri  =  {S^S) 


EAn 


=  R. 


-1  1 


Ef  {A 
e  EAi') 


(7.20) 


From  (7.6)  to  (7.8),  it  is  seen  that  asymptotically,  R^O  converges  in  probability  to  d, 
where  d  is  the  vector  of  the  limiter-correlators  (7.5).  Since  the  following  equality  con¬ 
verges  in  probability. 


di-dj  — {l-pij){6i-9j)  +  ^  9j{pii-pji) 

we  have  for  both  the  uniform-distance  and  orthogonal  sets, 

(7.21) 

■Pc(d  i  Ffi)  =  Pr  {max{df.-di)>0  \  H^}  =  Pr  {max(l-p)(6»^. -0,- )>0  \  Hi,}  =  P^  [6  \  H/,) 

I  i 


A  simple  counter-example  shows  that  Pij=p  is  also  necessary  for  (7.21).  Note  that  the 
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above  derivation  is  not  valid  for  the  simplex  set,  for  which  p=-l/(M-l)  and  is 
singular. 

In  conclusion,  for  most  cases  of  interest,  it  has  been  shown  that  generating  the  deci¬ 
sion  variables  from  one  form  or  another  of  M-estimation,  leads  to  identical  asymptotic 
error  probability  and  robustness  as  of  the  limiter-  correlator  implementation.  We  conjec¬ 
ture  superiority  of  the  first  for  small  sample  sizes  and  small  desired  error  probabilities) 
this,  however  must  be  confirmed  by  a  simulation  study,  preferably  with  real  non- 
Gaussian  data. 

7.4  Robust  M-ary  noncoherent  communication 

Similar  results  can  be  obtained  for  noncoherent  communication,  where  no  attempt 
is  made  to  track  the  phase  shift  introduced  by  the  channel  0,  The  only  difference  is  that 
we  have  to  consider  the  in-phase  and  quadrature  components  of  the  received  signal.  We 
demonstrate  the  structure  for  limiter-  correlator  implementation,  but  some  of  the  deriva¬ 
tions  of  Section  7.3  can  also  be  extended. 

Let  the  transmitted  signal  be  s^{t)=Re[C^[t)Qx^[j  27:  f  ^  t)\,  C^[t)= 

)•  The  signal  energy  is  given  here  by 
T  T 

^s^[t)dt  =  \C^{t)\‘^dt  =  eT  (7.22) 

where  we  have  assumed,  as  usual,  f  ^  so  that  double  frequency  terms  could  be 

omitted.  The  following  inner-products  are  defined 
T  T 

ReJC^(t)C,V)dt  =flA^(f}Ai(t)+B^((}Bi(t)jdt  =2£Tp^i  (7.23) 

0  0 

T  T 

ImJC„(t}C!*(()dt  =  flA^(t)Bi(t)-B„(t)Ai(t)]dt  =2eTp~^i  (7.24) 
0  0 

where  the  same  approximation  leading  to  (7.22)  has  been  used.  The  received  signal  under 
the  m  hypothesis  is 
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r{t)  =  Re[{AC^{t)e^^  +  n,{t)- 0<  t  <T  (7.25) 

where  nj(i)  and  ng{t)  are  the  in-phase  and  quadrature  components  of  the  narrowband 
bandpass  noise  process,  with  spectra  S„^{f  )=CNq/2  for  |  /  |  <B /2  and  zero 

otherwise.  For  validity  of  the  noise  representation  and  for  undistorted  reception  of  the 
signal,  the  standard  assumption  f  ^  >B /2»l/r  is  made. 

For  the  AWGN  channel,  the  optimal  receiver  decides  in  favor  of  the  largest  , 
where  here  is  the  output  of  a  quadrature  matched  filter,  cf.  [65],  The  following 
representation  of  ,  which  can  be  shown  to  be  identical  to  the  more  familiar  expres¬ 
sions  of  [65],  is  more  convenient  for  robust  modification. 

T 

4  =  +  jV^  =  /  C^{t)[X{t)+jY{t)]dt  (7.26) 

0 

where 

X(0  =  [2r(i)cos(27r/<,  OliPF  Y{t  )  =  {2r  {t  )sm{27vf  ^  t)]LPF  (7.27) 

and  the  symbol  [•jip/'  denotes  the  operation  of  passing  the  argument  through  an  ideal 
lowpass  filter  with  unity  gain  and  cutoff  frequency  B  /2.  Proceeding  as  in  Section  7.2, 
the  normalized  in-phase  and  quadrature  time  samples  on  the  m  hypothesis  are  given 

by 

(7.28) 

■  ^ni  ^  ]  “b  >  Vni^^n  ^  t  =l,..n  =BT 

The  assumptions  on  {n^i ,  }  are  identical  with  those  made  following  Eq.  (7.16).  As  a 

robustification  of  the  quadrature  matched  filter  for  the  AWNGN  channel,  we  introduce 
as  before  non-linear  limiter-correlators.  Namely,  we  still  take  d„  =  U^+V^  to  get  rid  of 
the  unknown  phase,  but  now  (7.26)  is  replaced  by 

~  ~Yj  l^mi  ^  {^ni  ^  [Vni  )]  =  Yj  {~^mi  ^  i^ni  )+^mi  ^  {Vni  )] 

'‘»=1  ^i=l 


(7.29) 
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Following  similar  arguments  as  before,  F„,  }  are  asymptotically  jointly  Gaussian. 

After  a  tedious  but  straightforward  Taylor  expansion,  the  following  moments  are 
obtained  : 

lim  E  (n  1/2  \Hk)  =  2A  [2jL)meEf  cos9  +  sin^]  (7.30) 

n  ->00  iV  Q 

lim  E{n^l‘^V^  |  FT*)  ==  2A  {^fl'^eEj  (/')[pm4sin0  -  P~mk<^osd\  (7.31) 

n->oo  ^0 

(7.32) 

lim  Cov[n^l^U^,  \  Hi,)  =  lim  Cov{n^l^V^,  \H,,)  =  2eEf  {l‘^)pmi 

n  -+00  n  —♦00 

lim  Cov{n^/-U„, ,  \  Hk)  =  2eEf  (/ VmJ  (7.33) 

n  -+00 

The  variances  are  obtained  by  substituting  For  Eqs.  (7.32  -33),  it  suffices  to 

assume  that  {71^,71^}  have  a  circularly  symmetric  distribution,  or  that  they  are 
independent.  Comparing  Eqs.  (7.30  -  33)  with  the  corresponding  terms  for  the  linear 
matched  filters  and  Gaussian  noise,  cf.  [65,  Chaps.  7.2  and  8.10],  we  observe  that  they 
are  identical,  provided  we  multiply  and  by  (n  /{NoEf  (Z^)))^"^^,  and  we  modify 
the  SNR  to  P(/,  /  )  according  to  (7.10).  Hence,  we  arrive  at  the  previous  conclusions 
on  robustness,  provided  the  error  probability  of  the  optimal  noncoherent  receiver  for  the 
AWGN  channel  is  a  monotonic  decreasing  function  of  the  SNR.  While  this  expected 
behavior  is  apparent  from  all  the  performance  curves  given  in  the  literature,  it  is  not  a 
simple  matter  to  prove  it  from  the  various  complicated  error  functions  for  noncoherent 
M-ary  demodulation.  (Unfortunately,  the  ordering  of  decision  variables  in  the  coherent 
case,  that  is  evident  in  (7.9),  is  not  preserved  here.  This  is  due  to  the  presence  of  the 
unknown  phase.  Consequently,  a  general  proof  of  monotonicity  has  not  been  found).  An 
exception  is  the  binary  case  with  orthogonal  signals,  Pmi=Pmi—^  (they  are  the  optimal 
for  noncoherent  binary  communication)  for  which  the  error  probability  is  given  by 
Pg  =(l/2)exp(-6/(27Vo)).  The  generalization  for  M-ary  orthogonal  signals,  which  are 
believed  to  be  the  optimal,  is  given  in  App.  J. 
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Appendix  A 


Small  Sample  Performance  of  the  Sign  Detector 

The  sign  detector  (SD)  is  the  simplest  and  probably  the  most  popular  non- 
parametric  detector  for  a  deterministic  lowpass  signal  in  noise  with  uncertain  symmetri¬ 
cal  density.  Under  several  assumptions,  it  is  known  to  be  the  UMP  detector  [32]  or  the 
minimax  LORD  detector  [5]  for  those  particular  situations.  Under  the  asymptotic  weak- 
signal/large-sample  [n q)  assumptions,  the  ARE  with  respect  to  the  linear 
detector  (LD)  at  noise  density  /  (•)  is  ARE^d  (0))^.  This  is  0.637  for  Gaussian 

noise,  2  for  Laplace  (double  exponential)  noise  and  in  between  for  many  other  densities. 
Hence,  the  SD  is  generally  considered  to  be  very  efficient  for  many  situations  of  uncer¬ 
tainty. 

Small  sample  analysis  reveals  that  it  may  not  perform  as  well,  as  the  fine  details  of 
the  actual  small  sample-size  error  probabilities  are  lost  in  the  asymptotic  analysis.  The 


sign  test  is  given  by: 

n 

'>t  , 

=  t  , 

Hi  with  probability  c 

(A.l) 

t  =1 

<t  , 

Ho 

where  u  (•)  is  the  unity  step  function.  Notice  that  a  randomized  test  is  required  since 
r(x)  is  a  discrete  binomially  distributed  r.v.  Assuming  symmetry  of  /  (•),  the  false 
alarm  probability  is  given  by: 


n 

(  n 

=  (-)"  ] 

.  k  J 

+  1 

c 

t  <n-l 

,  i=„  (A-2) 

When  n  is  small  and  the  desired  is  very  small,  the  second  row  of  (A.2)  must  be 

chosen,  and  therefore  the  detection  probability  is  also  proportional  to  the  randomization 
constant  c  which  might  be  very  small.  Denoting  Probfj^{xi  >0}  =  p  ,  it  is  given  by 


P,  =  cp"  ==ao2>"  < 


ao2" 


5iVfi-oo 


(A.3) 
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The  critical  values  of  the  sample  size  such  that  for  n  <n^  the  detector  becomes 
very  poor  {P^  <0.5,  even  for  SNR  -*oo)  are  found  to  be  12,  19  and  26  for 
Oq  —  10“^,  10  ®,  10*^,  respectively;  these  numbers  are  typical  of  many  practical  systems. 
A  similar  situation  is  encountered  with  the  Wilcoxon  detector,  as  its  minimal  false  alarm 
probability  (without  randomization)  is  also  2“” . 


-  195  - 


Appendix  B 

An  Approximation  for  the  Distribution  of  the 
Envelope  Detector 

In  this  appendix  an  approximation  is  derived  for  the  distribution  of  the  coherent 

n  n 

envelope  statistic:  R  =  /  =  (1/n )  ^  x,-  ,  Q  =  [l/n]  ,  where  x  and  y 

1  1 

are  i.i.d.  vectors  which  are  also  independent  of  each  other.  When  the  samples  are  Gaus¬ 
sian,  R  is  one  sided  exponentially  distributed.  Thus,  we  are  interested  in  an  approxima¬ 
tion  which  exhibits  this  exponential  limiting  behavior. 

Formally,  any  univariate  density  /  (a:)  can  be  expanded  in  a  series  of  orthonormal 
functions  {4>jt(^)}  ^ 

CX) 

/(x)=  E  C,  ^,{x)  (B.l) 

which  is  1^2  convergent  when  all  moments  of  /  exist.  A  classical  expansion  in  terms  of 
the  Gaussian  density  and  its  derivatives  (which  are  proportional  to  Hermite  polynomials) 
is  the  type  A  Gram-Charlier  series  [36],  or  its  reordered  version,  the  Edgeworth’s  form 
which,  when  applied  to  the  distribution  of  the  samples  mean,  gives  the  best  order  in 
powers  of  Conditions  for  convergence  of  the  infinite  Edgeworth  series  are  rather 

technical  and  implicit  (they  require  conditions  on  /  which  is  usually  unknown);  more¬ 
over,  they  are  of  little  value  from  the  statistical  viewpoint  where  the  important  question 
is  whether  a  small  number  of  terms  can  furnish  a  reasonable  approximation.  In  particu¬ 
lar,  while  the  Edgeworth  expansion  of  the  sample  mean  converges  to  the  Gaussian  den¬ 
sity  with  finite  terms  as  n  — +  oo,  when  the  expansion  is  carried  out  for  finite  n  ,  adding 
more  terms  in  the  series  above  some  optimal  number  usually  results  in  a  worse  fit.  See 
[36,  Chap.  6]  and  [37,  Chap.  16]. 

As  pointed  out  in  a  footnote  [36,  Section  6.24],  when  an  approximation  for  the  dis- 
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tribution  of  a  positive  random  variable  is  desired,  it  is  plausible  to  use  a  family  of  func¬ 
tions  which  are  supported  only  on  [0,  oo).  In  this  way,  the  error  from  the  tails  of  the 
Gaussian  density  and  its  derivatives  at  a;  <  0  would  be  eliminated.  Since  in  our  case 
the  limiting  one  sided  exponential  distribution  belongs  to  the  Gamma  family,  the 
appropriate  polynomials  are  those  of  Laguerre  defined  by  [38] 


I  ! 


dx 


T-(e 

I  ^ 


-I  „  t  +a 


(B.2) 


The  orthonormality  relation  is 


fe-x-  L,%x )  L/(x )  -  K-  +  ]  +  ')  (B.3) 


hence  the  expansion  is 


oo 

/(^)=  E  C'*  e-^x^L,^(x) 

1=0 

where 


(B.4) 


OO 

=  r(a  +'i  +  1)  (  )-'■(*)<'''  (B'5) 

The  first  four  polynomials  are 

J^o(x)  =  l  (B.6) 

Li(x}  =  1  +  a  -  X  (-3  7) 

^2(2;)  =  j  [(a  +  1)  (a  +  2)  -  2ar(a  +  2)  +  2:“]  (B.8) 

Ll[x)  =  ^  [(a+1)  (a +2)  (o+3)  -  3a;  (a +2)  (a +3)  +  3x2(a+3)  -  a;^]  (B.9) 

Like  the  Edgeworth  expansion,  it  is  possible  to  eliminate  the  terms  f  =  1,  2  by  a 


transformation.  Defining  x=by ,  the  following  expressions  are  obtained  from  (B.5)  and 
(B.6)-(B.9): 


Go  = 
6-1  = 


1 

6r(a  +  1) 

1 

6  r(a  +2) 


+  1)  -  -^1 


(B.IO) 


(B.ll) 
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=  Tnhfi  +  2)  (“  + ')  -  +  2)  T  +  ill  (B.ia) 

{B-13) 

'6  r(a+4)  +l)-3(a  +3)(a  +2)-^  +3(a  +3)-p-  -  -^] 

where  /z,-  =  (a:‘ ).  Solving  for  =  ^2  ==  0,  we  obtain: 


^  _  /^2  -  Ml  ^  J_ 

Ml  ""  /5 

2/('..  ..  2i  1  A 


a  =  Mi/(M2-Mi  )  -  1 

Hence,  upon  substituting  back  with  y  ==  /3x 


a 


(B.14) 

(B.15) 


/(^)  = 


/3“e 


-0X  X 


(B.16) 


r(a) 


+ 


/5' 


r(Q;  +  3) 


(a  +  2)/i2  -  Pys 


/9“e 


-0X 


{/3x) 


The  leading  term  is  recognized  as  the  Gamma  density  which,  as  tends  to  the 

desired  exponential  density. 

Integrating  (B.16)  to  get  the  false  alarm  probability  P^{x  >  t  /ij  =  P^{y  >  t  a} 
is  simplified  by  using  the  defining  expression  for  the  Laguerre  polynomials  (B.2).  Some 
tedious  algebra  yields: 


Pfa  =  Pr{^  >  i  Ml}  ={1-  I{a,  t  a)]  + 

c  ^ 

d - Ml,  M3)e~“‘  {at )“  [2at  (a  +  2)  -  {at  )2  _  («  +  i)  (q,  +  2)]  +  •  •  •  (B.17) 

where 


and 


C. 


a 

f{a  +  3) 


(«+l)(a+2)-^] 

J 


(B.18) 


is  the  incomplete  Gamma  function.  It  is  clear  from  (B.17)  that  the  second  term  vanishes 
for  t  =0  as  well  as  for  t  =co,  thus  fulfilling  some  of  the  properties  of  a  proper  distribu¬ 
tion  (since  the  first  term  is  a  proper  distribution).  However,  it  cannot  be  said  whether  or 
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t 


not  (B.16)  can  assume  negative  values,  a  common  feature  of  all  finite  Gram-Charlier 
expansions.  The  following  term  in  the  expansion  turns  to  be  so  complicated  so  it  will 
not  be  given  here. 

To  get  specific  results  for  the  coherent  envelope  statistics  R  defined  in  the  begin¬ 
ning  of  the  appendix,  we  need  to  express  the  moments  of  R  in  terms  of  those  of  the  i.i.d. 

samples,  E{R"')=  ;;p  •E' )^  +  (Zll/j  )^]^  •  Using  the  moment  generating  func- 

n  1  1 

tion,  tedious  but  straightforward  manipulations  yield 


,x,  =  E{R)  =  la? 

n 

ti,  =  E{R^)  =  ^[4  +  -^] 

n  ai 

=  E{R^)=^[24  +  {L-15K)] 


lA 


m  I 


err 


-15 


(B.20) 

(B.21) 

(B.22) 


where  =  E mj  =E{xA  and  K  is  the  coefficient  of  kurtosis.  When  x,-  are 
Gaussian,  both  K  and  L  obviously  vanish.  Substituting  these  into  (B.15)  and  (B.18)  we 
obtain: 


or  =  (1  +  A" /2n  ) 


,-i 


(B,23) 


_  ^  fo^.  I  o  I  9Ar  ,  L  —  15A  M 


CJ  = 


(B.24) 


Hence,  the  limiting  behavior  of  the  approximation  with  only  the  first  four  terms  of 
(B.4)  occurs  when  the  number  of  samples  n  is  much  greater  than  the  kurtosis 
{^K ! n  ►  0).  Then  ot  >•  1  and  the  first  term  in  (B.17)  approaches  as  required  by  the 
central  limit  theorem,  though  (unlike  the  Edgeworth  expansion)  the  first  term  itself  actu¬ 
ally  includes  all  powers  of  K /n  in  a  highly  nonlinear  fashion.  When  a  <  1,  a  com¬ 
parison  with  tables  of  the  incomplete  Gamma  function  shows  that  for  sufficiently  large  t 
(such  that  the  nominal  false  alarm  probability  Pf^  =  e~‘  is  small),  the  outcome  of  Eq. 
(B.17)  could  be  orders  of  magnitude  higher  than  the  Pf^  as  expected.  In  the  second 
term  of  (B.17),  all  the  expressions  tend  to  a  constant  when  a— >-1  except  G3'  which 
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vanishes  identically  when  a,  =  1.  For  other  values  of  a,  expansion  in  K /n  gives 


r  /  K  W  .  ^  f  \ 

hm  C3'  - - TW  +  ^  (—0-) 


n 


n  12 


(B.25) 


For  the  normal-normal  mixture:  /  (a:,-)  =  (1  -  e)  N{0,  1)  +  eA^(0,  C^), 


K  =  3e(l  -  e) 


C^-1 


12 


1-e  +  eC^ 


tC^  »  1  e 


L  -=  15 


(1  -  £  +  eCy 


tC^ »  1  e 


2 


(B.26) 

(B.27) 


Therefore,  with  small  amounts  of  large  contamination  and  n  not  very  large,  a  can  be 
much  smaller  than  one  and  the  contribution  of  (B,24)  is  also  not  negligible. 

Actual  computation  of  (B.17),  using  Equations  (B.20)-(B.24),  (B.26),  (B.27),  is 
shown  in  figures  2. 1-2.5  of  Section  2.2  and  compared  to  Monte-Carlo  simulation  results. 
For  a  roughly  in  the  range  [0.5,1],  this  approximation  is  seen  to  be  quite  good.  It  is  still 
a  reasonable  prediction  of  the  amount  of  increase  in  the  false  alarm  probability  down  to 
O'  =  0.3. 
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APPENDIX  C 


The  Maximin  Test  for  a  Single  Observation 


For  a  single  observation  of  a  r.v.  R  which  comes  from  simple  mixture  hypotheses 
Pq  and  Pi  as  in  Eq.  (1.1),  Huber’s  [2]  maximin  robust  test  takes  the  form 

'  ,  1{R]  L'  ,  L"  )>  t 


d*  (R)  =  <  Hi  with  probability  c 


,  1{R;L'  ,L"  )  =  t  (C.l) 

,  1{R;  L'  ,  L"  )  <  t 


where  the  soft  limited  1{R;L',L")  is  given  by  Eqs.  (1.6-1.8).  The  constants 


0  L'  ^  L'' <^oo  are  found  by  solving 


Pi{L  >L'  }  +L'  Po{L  <L'  }  =  (1  -  e)-i 
Po{L  <L"  }  +  {1/L  "  )Pi{L  >L"  }  =  (1  -  e)-i 


(C.2) 

(C.3) 


where  L{R)  =  f  i[R  )/ /  g(i?  )  is  the  LR  under  the  nominal  situation,  and  is  assumed  to 
be  continuous.  Application  of  these  equations  to  the  distribution  of  the  coherent 
envelope  of  a  Rayleigh  narrowband  signal  in  narrowband  Gaussian  noise,  Eqs.  (3. 3-3. 4), 
results  in  Eqs.  (3. 6-3. 7). 

Since  l[R  ;  L'  ,  L"  )  is  a  monotone  increasing  function  of  L  {R  ),  which  in  turn  is 
monotonic  in  R ,  Eq.  (C.l)  can  be  reformulated  as  a  randomized  test  on  R  .  The  thres¬ 
hold  and  the  randomization  constant  are  functions  of  the  least  favorable  p.d.f.’s.  Three 
different  cases  are  possible,  according  to  t  =  L'  ,  L'  <  t  <  L"  and  t  =  L"  ,  see 


Figure  1.  Define  R'  and  E "  hy  L  [R>  )  =  L '  ,  L  [R"  )  L"  .  Define  also 

OO  00 

a{x )  =  f  /  o(r  )dr  ,  P[x )  —  J  f  i{r)dr  .  Then, 


Case  A 


t=L'  =>  e  <{ao-a{R '))/{!- a{R'  ))  (C.4) 


Hi  with  probability  c  , 


R  >  R' 
R  <  R' 


(C.5) 


where  0<c  <1  is  given,  as  a  function  of  the  desired  oq  and  of  R  ’  which  solves  (C.2),  by 
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ao  -  e  -  (1  -  €)a{R  '  ) 


(l-e)(l-c.(i?'  )) 

The  detection  probability  under  the  least-favorable  is 

/3{d\q*)  =  c  +{l-e){l-c)  )  =  l-{l-ao)L‘ 


Case  R 


L  <t  <L"  =>  g  ~  ,)  <  ,  < 


\-a{R'  ) 


1  -  a{R  "  ) 


where 


Here, 


\  - 


,  R  >Rt 
R  <  Rt 


Q?o  —  £  +  (1  -  e)  a[Rt ) 


^(^*,<7  0  =  (l-e) /?(/?,) 


Case  C 


L"  =t=>e>{ao-  aiR  "  ))/(!  -  a{R  "  )) 


where 


d* 


(R)  =  { 


Hi  with  probability 

Ho 


R  >  R" 
R  <  R” 


(C.6) 

(C.7) 

(C.8) 

(C.9) 

(C.IO) 

(C.ll) 

(C.12) 

(C.13) 


Here, 


c 


_ «o _ 

e  +  (1  -  e)  a{R  ”  ) 


(C.14) 


I3{d*,qt)  = 


ao{l-e)/3{R"  ) 
e  +  (1  -  e)  a{R  "  ) 


O'qL 


II 


(C.15) 


Notice  that  in  this  case  where  large  e  necessitates  randomized  maximin  test,  the  result¬ 
ing  maximin  lower  bound  on  /9,  Eq.  (C.15)  is  very  small,  as  it  is  proportional  to  the 
desired  ag- 

The  proof  of  Eqs.  (C.4-C.15)  follows  from  (C.1-C.3).  For  Case  A,  (C.l)  is 
translated  into  (C.5)  as  a  result  of  the  above  mentioned  monotonicity  properties.  To 
satisfy  the  constraint  on  the  probability  of  false  alarm,  we  must  have 
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ao  =  P^{R  >R<  }  +  c  P^{R  <R'  } 

=  Po{R'  <R  <R"  )+Pa{R  >R"  }  +  cP^{R  <  R'  } 

Utilizing  the  expression  for  the  least  favorable  density  Qq  ,  Eq.  (1.6), 

ao  =  (l-e)Fo{^'  <R  <R"  }  +  c{l-e)Po{R  <R'  }  +  P  ,{R  >R"  } 

Lt 

The  last  term  is  equal  to  1  -  (1  -  e)Po{R  <  R"  },  by  virtue  of  Eq.  (C.3).  Collect¬ 
ing  terms,  we  obtain 

ao(l  -  e)(l  -  a{R '  ))(c  -  1)  -f  1 

from  which  (C.6)  follows.  In  order  to  satisfy  c  >  0,  we  get  the  appropriate  e  range,  Eq. 
(C.4).  In  a  similar  manner,  from  Eq.  (1.2)  for  , 

=  P{{R  >R')  +  cP[{R  <  R'  } 

=  [\-  e)P^{R>R'  } -t- (1 -e)  L'cPo{^  <  R'  ] 

and  the  last  term,  by  virtue  of  Eq.  (C.2),  is. equal  to  c  [l  -  (1  -  e)P ^{R  >  R'  }].  Col¬ 
lecting  terms,  the  first  equality  of  Eq.  (C.7)  is  obtained,  and  the  second  one  is  found  by 
substitution  of  (C.6)  and  using  (C.2)  again. 

Cases  B  and  C  are  proved  in  a  similar  manner. 


Figure  1 
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Appendix  D 

Statistics  of  the  SSQME  Test  with  Unmatched  Frequency 

We  first  generalize  the  asymptotic  normality  of  M-estimators  for  a  sequence  of  r.v.’s 
which  are  not  identically  distributed. 

Lemma 

Let  /G^  of  Section  4.1  and  assume  also  that  /  is  bounded.  Let  {x,- }  be  a  sequence  of 
independent  r.v.’s  with  p.d.f.  where  /  is  symmetric  and  absolutely 


continuous,  and  lim  nA^=a'^<oo.  Then  where  A„=arg  {  f]  l{xi-A„)=0} 

n  -*-00  1 

i  =1 


IS 


asymptotically  distributed  as  N{m  ,V)  where 


m 


(D.l) 


and 


V  = 


-  ,  EAl^)  ^ 

-EEiil')? 


(D.2) 


Proof  The  proof  follows  Huber’s  [1]  with  some  modifications  to  account  for  the 
unequal  means  and  variances,  hence  it  will  only  be  briefly  sketched.  By  monotonicity  of 

a-), 


lim  Prob  {A„  -An  <  ^  }  =  lim  Prob  ~  ^0} 


(D.3) 


Define  y„i=l[xi-kn  Upon  expanding  in  a  Taylor  series  of  powers  of  n  and 
invoking  the  assumed  symmetry  properties  of  /  and  /,  some  terms  vanish  and  we 
obtain: 


m„iAE{Y„i)  =  n  -  k)Ei{l')  +  O  {n~^) 


(D.4) 
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(n  (D-5) 

There  is  a  slight  complication  since  the  t/„,-  are  different  for  different  values  of  n ,  and 
the  usual  formulations  of  normal  convergence  do  not  apply.  However,  this  case  is  covered 
by  the  Lindeberg  -  Feller  theorem  [41],  which  asserts  that  a  necessary  condition  for  con- 

vergence  to  a  standard  normal  distribution  of  5„  =  ^  ^yith  cr„-=  ^  v„i ,  is 

*=1  2=1 

that  each  summand  contributes  negligibly  to  s„  in  the  sense  of  Lindeberg’s  condition. 
Explicitly, 


lim  max 

M  — ♦  00  1  <  i  < 


^  cr; 


(D.6) 


which  is  satisfied  since  /(•)  is  bounded,  u„,-  ^0  and  grows  as  n  ;  The  sufficient  condi¬ 
tion  is  proved  in  the  same  manner  as  in  Huber[l,3].  Interchanging  of  lim  and  Frod  {■} 

n  —^co 

is  allowed  since  }  converges.  Subtracting  \im  /(rn={m-k)/vy-  from  both 

n-^00  ‘ 

sides  of  the  argument  of  the  second  expression  in  (D.3)  yields 


lim  Prob  {A„\/n'  <k} 


^1/2 


(D.7) 


We  now  turn  to  prove  Proposition  5.3.  Let 


'  k ' 

5, 

Qi  _ 

Vi 

(D.8) 

where  c,- =cos0,' ,  s,-=sin0,-,  Q^=^‘l'Kik  j  n  .  The  input  observations,  with  a  sinusoid  signal 
of  Doppler  frequency  /  ,  are  given  by 


^  ^  cos(^+<;i,-)  ,  2/,-  =  sin(</i+<^,- )  (D.9) 

where  (  A  measures  the  frequency  deviation  from  the  nearest 

filter,  normalized  by  the  filter  bandwidth  /,.  jn  ).  Let  /  and  Q  be  the  M  -  estimators  of 
the  {/,■}  and  {g,-},  respectively,  where  the  /(•)  function  is  assumed 
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monotonic  ,  bounded  and  skew  —  symmetric  .  As  a  consequence  of  the  symmetry  of  /(•) 
and  the  transformation  (D.8),  the  following  expectations  under  Hq  (a  =0)  satisfy: 

=  Eo(l(Q,))  =  Fo(l'U)}  =  ES"{Qi))  =  0 

E^{V)  J  E^{V{k))  =  Eoil'iQi))  ^  0  (identical  Vf) 

Eo(l^)  ^  Eo{l\li))  =  E,[l\Qi))  ^  0  (identical  \ii) 

Eo{Kli)l{Qi))  =  Eo{l{Ii)l'{Qi))  =  E,{lVi)l[Qi))  =  0 

Eo{EjI'q)  a  EQ[V[Ii)l\Qi))  ^  0  (identical  V  i) 

Notice  that  /,•  and  Q,-  are  in  general  not  independent  under  Hq  (  unless  X;  and  Y,-  are 
Gaussian),  but  they  are  uncorrelated,  and  their  joint  density  is  circularly  symmetric  - 
/o(A'>  Qi)=9  Since  the  following  derivation  depends  only  on  the  above  pro¬ 

perties  of  the  various  expectations,  which  are  valid  with  any  bivariate  circularly  sym¬ 
metric  p.d.f.,  the  original  assumption  that  and  are  i.i.d.  can  be  replaced  by 
assuming  they  are  circularly  symmetric.  In  particular,  this  is  true  when  the  narrow  - 
band  noise  process  is  realized  in  the  “natural”  way  as  .  =X,- cos7,-  ,  Xj  =A/',- sinq,- , 

where  7,-  is  C/[0,  27r]  and  is  independent  of  A/,-,  which  can  be  arbitrary  positive  r.v. 

Applying  the  lemma,  /  and  Q  are  marginally  Gaussian.  Since  o(/,- )=  Yaro(Q,- ), 

(D.2)  yields 

lim  Var^[sf^I)  =  lim  Vary[\f^Q)  =  E q{1") / E q-[1 ')  A  ^1  (D.IO) 

n  —►00  n  — »-oo 

From  (D.l), 


lim  £;i(\/^/)  =  lim  ±Y^cos[<i>+<t>i)  A  I 

n  —►00  n  ~*oo  n 


(Dll) 

lim  Ei{\fnQ)  =  lim  —Y]s'm{4>+(pi)  ~  Q 

n  -►oo  n  -*oo  71 


We  want  to  show  that  the  joint  distribution  of  I  and  Q  ,  averaged  with  respect  to 
is  in  the  form  of  Eq.(4.15).  For  that,  it  is  needed  to  show  that  /  and  Q  are  asymptot¬ 
ically  independent.  Invoking  the  monotonicity  assumption. 


(D.12) 
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P  {xAT/  <c,V7iQ  <d}  =  P{U^  A  1  <  0^  A  1  <  0| 

V  n  Vn 

where 

^ni  =  l{Ii--j=)  =  /(iVc/  +'^^)  >  Pi  =  a  cos(^+(?i,--6',-)-c  (D.13) 

=  l{Ii—^)  =  I  {Ns/  +-^)  ,  qi  =  asm{(f)+4>i~di)-d  (D.14) 

and  the  primed  r.v.’s  are  the  result  of  the  transformation  (D.8)  applied  to  the  input 
noise  samples. 

We  have  to  show  that  the  r.v.  VK„  C/„ +/? ^  is  asymptotically  Gaus- 

i=l 

Sian  for  any  o;  and  /?.  Let  =(  ))/(  E  Far  ( IK^,- ))'/'  and  consider  only 

i=l 

bounded  /(•)  functions,  which  are  sufficient  for  our  study.  (Actually,  monotonicity  is 
sufficient  for  the  following,  but  the  proof  is  longer).  Obviously,  |  |  <M„,-  <co  and 

-^ni  =0  due  to  the  above  normalization.  Thus,  Lindeberg’s  condition  is  satisfied 

(see  [64,  pp.  201]),  U„  and  F„  (hence  also  I  and  Q  )  are  asymptotically  jointly  Gaussian, 
and  it  suffices  to  show  they  are  asymptotically  uncorrelated  for  independence  (Note  that 
thus  far  no  assumption  on  the  joint  distribution  was  necessary).  Expanding  in  a  Taylor 
series,  noting  that  some  of  the  various  expectations  under  Hq  vanish  as  above,  we  find 

^ ^  ^ ^ 

i( F„,- )  =  +  0{ — ^)  (D.16) 

Var  i(  )  =  Var  i(  F„  )  =  Eo{l^)  +  O  (i-) 

n 

Hence, 

^  i(  F„  )  -  ^ ,( t/„  )E  i(  F„  )  =  1  E  [E  i(  F„,  )-E,{  (7„,  )E  /  F„, )] 

1=1 

=  A  S +  0(-U)  =  0(±) 

(  ^  t=i  j 
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Therefore,  the  asymptotic  joint  distribution  of  U„  and  is  that  of  a  pair  of  indepen¬ 
dent  Gaussian  variables.  Returning  to  (D.12)  and  using  (D.IO  -  D.ll), 

lim  P  {\Ar/  <  c  ,  <  d}=  lim  P  <  0,  <0} 

n— *00  n  — ►oo 

=  \im^p  I  <  c}P  {vV  g  <  d }  = 

Transforming  \/n  I ,  \fn  Q  ,  the  argument  of  the  exponent  of  the  joint 
Gaussian  density  is  thus 


-1 

2V, 


{  P+  g2+  r+  Q‘^-  2(7/  +  gg ) } 


The  middle  pair  does  not  depend  on  (f) 

lim  [i-^cos(^!>+^!.,-)]- +  [— ^sin(?^+(?i,-)]“  =  lim 


n  -*oo  n 


=  lim  |iE^iP+|-E».P=  lim 


s\n\n -K  f  i  /  f ,) 


n  — ►oo  n 


n 


sin(7rA) 

ttA 


”-*00  nhin^TTf  j  /  f ,) 
where  here  Ci  Acos{(j>i)  and  s,-  ^sin(</>,-),  by  straightforward  calculation.  Also, 


A  G(A) 


II  +  gg  =  lim  a  \/ G^+5'^cos((?!>-tan  ^(-^)) 

n  —►00  C 

where  C  [1/ n)Y^ci+Q  [\/ n)Y^Si  and  5  ^ -/(1/rz  +<5  (1/n  )^Cj  ,  from  which 

lim  (7^+5'^=  (/^+Q^)G(A)  follows  as  can  be  verified  by  substituting  for  the 

definitions.  We  finally  have  for  the  asymptotic  conditional  joint  density 


/  (/,g  \A,  exp 


(D.17) 

7^+g  ^d-nA  “G  [A)—2\/n' A  \/(/"+g  ")G  (A)cos(ijS— tan“''(-^) 

c 

-2V, 


Averaging  this  with  respect  to  (f)  on  [0,27r]  gives  the  same  expression  as  in  section  4.2, 
since  S  and  C  do  not  depend  on  (j),  and  where  the  effective  SNR  is  replaced  by 
nA^g(A)/2Fi.  Hence,  the  detection  probabilities  for  the  various  SSQME  tests  of  Sec¬ 
tion  5.2  are  identical  with  the  case  7^=0  if  the  effective  SNR  is  attenuated  by  g(A). 
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In  the  FFT  processor,  the  sidelobes  can  be  reduced,  and  the  response  can  be 
flattened  over  the  mainlobes  of  the  individual  filters  (  |  n/^//,.  |  <0.5),  at  a  price  of 
reduced  SNR  at  /,;=  0,  by  appropriate  weighting  (e.g.,  Hamming’s): 

{  x,y}— »-{w^x,w^y},  which  changes  G  {A)-*  \  —  u’t  exp(y  27rA:  A/n  )  |  *.  The  same 

^  /t=i 

can  be  done  with  the  SSQME  test.  The  derivation  of  the  test  statistic  is  similar  to  that 
above,  but  now  we  have  to  account  for  the  unequal  variances  as  a  consequence  of  the 
weighting.  The  frequency  response  will  be  given  by  G  {A)== 

I  — 27r^  A/n )  |  where  Wj^'A  The  design  of  the  weighting 

k  k 

coefficients  is  more  complicated  than  in  the  linear  FFT  case;  first  {w^. '  }  are  chosen  for 
the  desired  response,  and  then  /  /  ’{wk  x  )f  {x  )dx  =  w^'  has  to  be  solved  for  {w,,}. 
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Appendix  E 

The  Efficacy  of  Quadrature  Tests 

Pitman’s  efficacy  is  usually  employed  as  an  asymptotic  performance  measure  of  test 
statistics,  whose  asymptotic  distribution  is  Gaussian,  c.f.  [21],  [25],  However,  the  asymp¬ 
totic  distribution  of  the  quadrature  type  tests,  which  are  studied  in  this  thesis  for  detec¬ 
tion  of  coherent  fading  narrowband  signals,  is  non-Gaussian.  Hence,  an  appropriate 
definition  of  efficacy  is  needed. 

Let  Q„  =  T^(x]  +  T„\y),  where  the  statistics  T„{  )  are  asymptotically  Gaussian, 
under  both  the  hypothesis  and  a  location  shift  alternative;  /  =  /  o(2:,'  -  a),  with 

Eor„(-)  =  0,  VoT„{-)  =  a^-  (E.l) 

EiTn{-)  =  Vng{a),  Fj  r„  (•)  =  -b  O  (a  )  (E.2) 

Assume  that  (0)  =  0,  0  <  j  g'{0)  \  <  oo.  For  narrowband  slow  fading  signals, 

n 

/  (x,  y)  =  n  /  0(2:, •  -  aA  cos9,  y,-  -  aA  smd)  (E.3) 

1=1 

where  A  is  a  positive  r.v.  with  arbitrary  distribution,  and  9  is  uniform  r.v.  on  [0,27r]. 
Notice  that,  with  proper  normalization,  A  is  a  dimensionless  r.v.,  represents  the 
SNR,  and  the  influence  of  the  test  statistic  T„  (•)  is  gauged  by  g  (•)  and  a.  Define 

rrij.  A  E^Tn  (x)  =  -/n  g  (aA  c.os9),  my  A  E iT„  (y)  =  \fn  g  (aA  sin^)  (E.4) 

and  assume  also  that  (x)  and  T„  (y)  are  jointly  Gaussian  and 


lim  Ei{(r„(x)  -  m^){T„{y)  -  m  )}  =  0(a) 

n  — ►  OO 

i.e.,  r„(x)  and  (y)  are  asymptotically  uncorrelated  when  a  — 0. 
Definition  The  efficacy  of  the  test  is 


.  AM0}f 


lim  — 

n  — *  00  Tl 


dE iT„  / da  |  a  =0 


1  2 


(E.5) 


which  is  recognized  as  the  common  definition  of  the  efficacy  of  T„  . 

Justification  We  have  to  show  that  6q^  parametrizes  the  asymptotic  power  function. 
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For  that,  the  conditional  (on  A  and  0)  asymptotic  distribution  of  Q„  is  needed.  To 
avoid  singular  detection,  consider  a  sequence  of  alternatives  a„=Sn~^^^.  By  virtue  of 
our  assumptions  on  the  asymptotic  distributions  of  (x)  and  T„  (y),  their  asymptotic 
distribution  is  bivariate  Gaussian  with  zero  correlation.  Therefore,  the  asymptotic  con¬ 
ditional  p.d.f.  is  non-central  Chi-squared  with  two  degrees  of  freedom  and  non-centrality 
parameter  X  =  -p  m/,  c.f.  [51,  pp.  113-118], 

/  .  M  ,«)  A  to  1  exp  (- I 
n  -  00  2c7^  [  2cr~  } 

where 

h{a)  =  g\aA  cosO)  -f-  g\aA  sm9)  (E.7) 

Under  Hq,  Eq.  (E.6)  reduces  to  a  one-sided  exponential,  thus  a= 

^o{Qn  >  t  }=exp(-t /2(7-).  Under  77 1, 

00 

^  ^  ^  =  E^Eg  f  f  loo  (Qn  I  A  ,0)dQ„  (E.8) 

-ZcT^loga 

where  interchanging  of  the  limit  and  integrations  is  justified  by  the  dominant  conver¬ 
gence  theorem.  Expanding  Eq.(E.7)  in  a  Taylor  series  and  utilizing 
ff(0)=0,  U'(0)|  <  oo,  the  first  two  terms  vanish  and  we  obtain 

^  +  O  (l/\Ar )  (E.9) 

which  is  not  a  function  of  9,  to  order  Thus,  with  the  definition  of  Marcum’s  Q- 

function,  the  limiting  power  is 

/3  =  E^Q[6A  I  ^  '(0)  I  ja,  \/~2log  a]  (E.IO) 

For  a  given  distribution  of  A  ,  and  a  given  input  SNR  (d),  the  detectability  is  thus 
governed  by  the  parameter  j  ^  ^(0)  j  /<t,  and  the  definition  is  justified.  (Recall,  however, 
that  the  random  phase  and  amplitude  incur  detectability  loss  in  terms  of  the  required 
input  SNR,  c.f.  [13],  [51].) 


VQn  nh[a^) 


(E.6) 
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Appendix  F 

Efficacy  Optimal  Tests  for  “Single  Sweep”  Ranking 

Let  {x,o},=i  be  the  observables  which  contain  the  information  on  the  alternative 
K,  and  },•"=!,  A:  =  1,...,M  be  noise  reference  observations.  Under  /v,  a  location 
shift  model  is  assumed 

n  M 

f  (xo,  Xj,  x^^)  =  n  /  (Jio  -A)Uf  {Xik  )  (F.l) 

1—1  k—l 

In  radar  terminology,  during  each  “sweep”  i  (the  repetition  interval  of  the 
transmitted  pulse  train),  M  reference  observations  are  taken  in  a  region  around  the  pos¬ 
sible  signal  location. 

A  “single  sweep”  rank  statistic  is  defined  as 

n 

'Tn  =  Y!i  ^M+l{^M  +  l,i)  (F.2) 

t=l 

where  RM+i,i  is  the  rank  of  x.-q  in  {x,-,(.  in  the  following,  the  M +1  subscript  will 

be  dropped  for  convenience. 

As  T„  is  the  sum  of  i.i.d.  random  variables,  it  is  asymptotically  Gaussian  with 

E{T„}  =  nE{a{R)},  Var{T^}  =  nVar{a{R)}  (F.3) 

The  discrete  probability  distributions  of  the  ranks  is  given  by 

Piin  J  Pr  {Pi  ==  I  e  [1,...,M+1]  \K}  =  P,  {x,(,_i)  <  x,o  <  Xi(,)}  (F.4) 

00 

=  (/^l)  /  f{x-A)F‘-\x)[l-Fix)f-^^-‘dx 

-00 

and 

Pq{1)  =  Pr  {Pi  —  ^  i  ~  +  1)  (^•^) 

Thus, 

M-fl  -  M-hl 

E{a{R)\K}==  Y:  ciinPiin.  Eo{a)  =  -^  Y  « (0 

M+1 


(F.6) 
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and 


dE{a{R)\K\ 


M+l 


dA  ix_o"i7Tr  ,5, 


where 


(F.7) 


a{lj)  j  f  '{x)F^-\x)[l-F[x)] 


M  +  l-l 


dx 


Also, 


M-hl 


A/+1 


(F.8) 


Var{a{R)\H}^  E  ^o(0[a  (0-^o(a  )P  =  E  [«  (0-^o(«  )f  (F.9) 

and  it  can  be  easily  verified  that 

hm^Var{a{R)\K}=  Var{a{R)\H} 

Pitman  s  conditions  [25]  are  satisfied,  and  the  efficacy  is  given  by 


e  = 


1 

dF{a(R)lK}/dA 

'  M+l 

-A==0-  . .  . 

Var{a{R)\H} 


M+l  T2 

E  (a  (/)-^o(a  ))«(/,/) 

l=\ 


M  +  l 

E(a(/)-^o(«))' 

i  =  i 


-(F.IO) 


M+l 


where  we  have  used  E  ‘^(^)/  )  =  0,  as  can  be  shown  from  its  definition,  Eq.  (F.8). 

Eq.  (F.IO)  can  be  used  to  measure  the  efficacy  of  an  arbitrary  score  function  «(•)  at  a 

given  noise  p.d.f.  By  the  Cauchy-Schwarz  inequality,  6  is  maximized  by 
a  (f )  -  £'o(a  )  ==  a  (/ ,/  )  =>  a  (/ )  —  a  (/ ,/  ),  and 


1  M+l 

max  €  =  — -  E  ,f  ) 

{«(()}  M+l  ^  > 


(F.ll) 


when  the  signal  is  time  varying,  A  s,-A  in  (F.l).  Due  to  the  sweep-to-sweep  indepen¬ 
dence  of  the  Ri ,  the  influence  of  the  s,  on  6  appears  as  a  multiplying  factor.  Following 
the  same  steps,  it  is  readily  shown  that  the  optimal  test  is 


E  ®M +l('^M +l,t  ) 

J  =1 


(F.12) 
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and  Eq.  (F.ll)  should  be  multiplied  by  ^  s,-^. 

»=i 

For  large  M,  the  optimal  scores  can  be  replaced  by  the  approximation  [49] 
a  where  <t>[u  J  )  = -f '{F-\u))/ J  {F-\u)),  and  then 

max  e  I{f  ).  Hence,  when  M  is  large,  the  “single  sweep”  ranking  structure  does  not 
entail  any  losses. 

It  is  easy  to  show  that  T„  of  Eq.  (F.2)  with  the  optimal  scores  a{l,f)  is  also 

n 

locally  optimal  for  finite  n  ,  since  P i'{Ri)/P o{Ri). 

i  =1 

The  a{l  ,f  )  are  also  the  optimal  scores  of  the  corresponding  quadrature  test 

n  n 

[ZI  +  (Zl  since  its  efficacy  is  identical  with  (F.IO),  see  Appendix  E. 

1  1 


Appendix  G 

The  ARE  of  two-sample  and  single-sweep  rank  tests 
with  Wilcoxon  scores 


From  Eq.(F.lO)  of  App.F,  and  from  Eq.(6.3.12) 


c2 


[{  J'{F(^))f’ix)d2 


ARE^s  ss  =  (1 — — )•  — 


N 

E«-(0 

i=l 


fj^{u)d2 

0 


AE«(0«(i,/  )f 


(G.l) 


where  A=M+1,  a{i,f  )  is  given  by  Eq.(F.8),  and  the  indices  2S  and  SS  stand  for  two- 
sample  and  single-sweep  test,  respectively. 

With  Wilcoxon  scores,  J{u)=u-l/2,  and  a  (f  )=27(yV+1)-1/2.  The  ratio  of 

integrals  in  (G.l)  is  easily  computed  to  be  12[J  f -{x)dx]-.  Also,  S  «"(0= 

i  =1 


(Ar-1)A^/(12(A^+1)).  For  the  remaining  term, 

N  .  N 


E( 


1 


1=1  A/^+l  2 


1 


N+1 


E  *«(g/  )= 


N 


N+1  ,7^ 


E  >'  (7//  '(*  )!"■'  dx 


N 


N-1 


E  (1+-^  iV)if  )[i-^ 


N+l  k=o 


Writing  this  as  the  sum  of  two  terms  by  breaking  the  (l-l-Z:),  the  term  corresponding  to 
the  1  vanishes  after  changing  the  order  of  summation  and  integration  and  utilizing  the 
binomial  expansion, 

^-1  /  yV-l  \ 

E  (  ^  =  1 

k=0 

00 

since  J  f  '{x)dx=0.  For  the  term  corresponding  to  k,  the  ^"==0  term  in  the  summa- 
-00 

tion  vanishes,  thus  another  change  of  variables  yields 
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Interchanging  the  order  of  summation  and  integration  and  using  the  binomial  expansion 
again,  and  integrating  by  parts,  yields 


A 


N(N-l) 

N+1 


00 

I  f  \^)dx 

-00 


Combining  all  the  above  derivations,  (G.l)  reduces  to 


ARE  25,55 


which  is  independent  of  /  (•). 


M+l  ' 


(G.2) 
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Appendix  H 


Efficacy  and  Integration  Gain  for  Non-coherent  Stochastic  Signals 

Let  the  available  observations  under  the  alternative  be  the  in-phase  and  quadrature 
components  of  a  narrowband  process  {a;,- ,  The  densities  are  given  by 


and  /  o(x,y)  —  /  i(x,y;0).  Here  /  (■)  is  the  joint  density  of  the  narrowband  noise, 
Fg  (  )  is  the  joint  distribution  of  the  additive  stochastic  signal,  and  the  parameter  a,  is 
related  to  the  SNR.  The  following  assumptions  are  made:  1)  all  first  and  second  deriva¬ 
tives  of  /  (•)  exist;  df  Idx^dy^  is  denoted  and  so  on.  2)  {s, ,  s,  }  is  a  wide  sense 

stationary  process,  with  Es^,-  =  =  0.  The  absence  of  a  common  mean  can  actually 

be  viewed  as  a  definition  of  a  non-coherent  stochastic  signal.  In  particular,  it  is  true 
when  5j,-  =  A,- cos^,- ,  Sg,-  =  y4,  sin0,-,  and  the  phases  0,-  are  i.i.d. 

Consider  a  memoryless  test  statistic 


n 


^  =  S  gi^i,  Vi ) 


Under  rather  mild  regularity  conditions,  T  is  asymptotically  normal,  and  the  asymptotic 


power  IS 


^  =  lim  (f) 


a  — ►  0 


E^T-EqT  V^I'^T 
V^I^T  V^l-T 


=  <t>[d 


for  some  5>0,  where 


d  =  lim 


E^T  -EqT 


Expanding  the  numerator  in  a  Taylor  series 


but 
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dE^T 

da 


^-E{ 


n 


S  (^ci  /  +'^51  I  ) 

t==rl 


n 


E  9[^j.  Vj) 

y=i 


}  =  o 


where  the  expectation  is  taken  over  all  the  random  variables.  Thus,  unlike  the  coherent 
signal  cases,  the  first  term  in  d  is  proportional  to  rather  than  to  a,  under  the  most 
general  assumptions.  This  also  implies  that  a  non-trivial  definition  of  a  locally  optimal 
detector  for  non-coherent  signals,  is  the  one  that  maximizes  the  second  derivative  of  the 
power  function  at  zero. 

To  get  a  simple  expression  for  the  efficacy,  we  assume  now  that  the  noise  samples 
are  i.i.d.,  and  that  the  signal  is  uncorrelated  and  its  spectrum  is  symmetric  around  the 
carrier  frequency.  These,  together  with  the  w.s.s.  assumption,  amount  to  (ci.  [14],  [5l]) 

Thus,  it  is  readily  found  that 


Hence, 


d^E^T 
da  ^ 


VoT  =nVog,EoT  nE,g 
”  I  I  9{x  ,y)[f  xx{^  ,y)  +  f  ,y)]dxdy  A  nl{g  J  ) 


,  ^  ^/na‘^  I{g  ,/  ) 
2  Vo^/^g 


By  way  of  comparison,  for  coherent  signals  d  is  proportional  to  a  ,  and  is  larger  for  weak 
signals.  If  the  narrowband  noise  is  normalized  to  unity  variance,  a"/2  =  SNR  .  The 
integration  gain  for  two  detectors  with  the  same  g  (•),  is  defined  as  the  limit  of  the  ratio 
of  the  input  SNRs,  when  both  operate  at  the  same  level.  Thus, 


ai  —  =  ot 


=  /^2  7^ 


By  way  of  comparison,  the  integration  gain  of  detectors  for  coherent  signals  is 
(ni/rio),  therefore  coherent  signals  lead  to  better  detection  performance. 
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Appendix  I 

The  efficacy  and  asymptotic  distribution  of  rank  Doppler  tests 

I.l)  One  -  and  two  -  sample  rank  Doppler  tests 

With  the  definitions  of  Chap.  6.6.1,  the  means  of  the  observations  after  the 
“butterfly”  transformations,  Eq.(6.6.2),  are 

di^EiIi=aAsiCos6i[l-U{i-n-l)]  ,  5,-  ,^27r2A/n+6'  (I.l) 

di^AEiQi  =ayls,-sin5,-  [1-17 (f-n-1)]  (I.2) 

Throughout  this  appendix,  all  expectations  and  variances  under  are  conditional 

expectations,  given  A  and  9.  To  compute  the  efficacy,  the  asymptotic  means  and  vari¬ 
ances  of  /  and  Q  ,  defined  in  (6.6.4),  are  needed.  They  can  be  found  from  Theorem  6.2.9, 
provided  the  conditions  (6.2.23)  and  (6.2.24)  are  satisfied.  For  the  /  channel,  from  (I.l), 

=  Y:{d/-d7f  =  na^A\C^-jL.)  < 

i=i  M-t-1  - 

where  C  A{l/n  )  X]  c,- ,  c,-^  Ci  Asi  cos(5,- .  Thus, 

i=l 

naU2^<  a^A^  f]  =  na^A-E^  <  00 

t=l 

Hence,  for  na‘^-y  constant  and  ^<00,  lim  <co  and  (6.2.23)  is  satisfied.  The  condi- 

n  —►00 

tion  (6.2.24)  can  be  written  with  the  above  definitions  as 

B  n|^-CV(Af+l)| 

max{(c,--C'/(M+l))2  ,  (^/(M+l))^}  ^  ‘ 

n 

For  an  absolutely  summable  signal  sequence,  {l/n)Y^  \  s;  |  <co,  all  the  expressions  in 

i  ==1 

(1.3)  are  finite,  thus  lim5„=oo  as  required.  The  conditions  for  the  Q  channel  are 

n  “+00 

satisfied  in  a  similar  manner. 

Invoking  Th.  6.2.9,  we  have 

l‘l  =  Ell  =  -^aAJ](s,—A^)s,casSi  G'(J,  f  ) 


(1.4) 
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fXQ  ^  E,Q  =  G'{J,  f  ) 

where  G'{J,  f  )  is  the  integral  in  (6.2.27).  From  (6.2.28)  and  (6.3.14), 


(1.5) 


Vol  =  VoQ  = 


1— 


52 


M+1 


f  J^(u  )dv 


(1.6) 


As  in  App.  D,  the  derivation  following  Eq.(D.7),  6  cancels  when  summing  Ai/'+Mg.  Hence, 
the  efficacy 


€  r  =  lim 

”  n  —>-00 


(d  / da  )^+(d  Hq  j da  )^ 


VJ 


is  found  when  substituting  (6.6.5)  into  (6.3.11) 


(1-7: 


1.2)  Single  -  sweep  rank  Doppler  test 

T„  {k  )  of  Eq.  (6.6.11)  can  be  written  as  T„  {k  )=I^+Q^^  where  here 

7  ! —  S  T)  , —  2  (u,-  c,-  +u,'  5,- )  ,  Q  =  —  ^  2  (— Sj  +u,-  c,-  )(1.8) 

Vn  ,-=1  Vn  ,-=i  Vn  ,-=1  vn  i  =1  »  wv  ; 

{uj,  u,-}  are  the  scored  single  -  sweep  ranks  defined  in  (6.6.10),  and  here 

c,-  ^  COSO;,-  ,  s,-  ^  sina,-  ,  o;,-  ^  27rf/[:  /n  (1.9) 

From  the  original  assumptions,  {C/,}  and  {V^}  are  sequences  of  independent  r.v.,  and 

Ui  is  independent  of  Vj.  Though  {/,}  and  [Qi]  are  not  identically  distributed, 

Liapounoff’s  version  of  the  central  limit  theorem  is  valid  for  I  and  Q  ,  since  the  third 

centered  absolute  moments  of  /,■  and  Qi  are  bounded,  as  they  are  discrete  and  bounded 

r.v.  Paralleling  App.  D,  to  prove  (6.6.12)  we  have  to  show  the  following; 

E,\I)  ,  EAQ) 


a)  lim 


+ 


Hy-^HoVar^{I)  Vari{Q) 

bl  lim  +  <3^l(g) 

H,-^H,Var^{I)  Vor,(g) 

,  ,  E,(IQ)-EiI)E,{Q) 

C)  iim  -  ,  — 

\/yar  i(/)  Var  ^{Q  ) 


G(A)  =  [sin(7rA)/7rA]= 


V I^+Q‘^cos{0  -  tan~^6(/ ,  Q  )) 


Here,  lim  is  a  shortened  notation  for  the  limit  (n  — voo,  a  — >-0,  na~<co),  ~  denotes 
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proportional  to,  and  6[I ,  Q)  is  some  function  R^—*R  \ 

Throughout,  we  assume  the  score  function  a^+iCO  is  skew  -  symmetric,  thus 
^o{Ui)—^o{Vi)=EQ(I)=EQ{Q)=^0.  The  observations  model  is  of  Eq.(6.3.1),  where 
=2m  {k  +A)/n  +9,  see  (6.6.1). 

a)  Expanding  in  a  Taylor  series  for  g  {a  )=E i{I)  yields 


Ei[I)  —  fi'^(0)+2og  (0)^  (0)+a^((7^(0)+^  (0)£r'(0))+O  (a^)  (I  lO) 


but. 


where 


■Pli(0  —  P  l{RlM+i)i=^}  =  (i_Jf  f  {x-aA  cosl3i)F‘  ^(2;  )[l-F  (x  da:  (1.12) 

Pfiin  =  Pi{P(M+i)i=^}  =  (x-aAsin/3i)E‘-\x)ll-F(x)f^'^^-' dx  (1.13) 


Thus, 


)  _  Aa  (/ ,/ )cos/3j* 

dd  tV/  -j-  1 


where  a(/,  /  )  is  given  by  Eq.(F.8)  of  App.  F  and  is  assumed  finite  (sufficient  condition 
is  /  to  be  strongly  unimodal).  Differentiating  (I.ll)  and  substituting  (1.14)  yields 


In  a  similar  manner, 


dE,{Q) 


(1.16) 


da 


Combining  (1. 10)  with  (1.15  -  1.16)  and  j3i-ai=6i  of  (I.l), 


Notice  that  6  has  canceled.  Due  to  the  i.i.d.  assumption  and  from  the  mean  value 
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theorem, 


Vari(/)  ,(/,)-  K«r„(/)+  if; 


^i=l 


i=l 


<  Faro(/)  +  a  max{ 


da 

dVar,{Ii) 

da 


(1.18) 


a '  <  a 


a'  < a 


} 


Hence,  a  sufficient  condition  for  lim  yari(/)  =  VarQ{I),  is  lim  dVar  ^{lA/da  <oo,  \f  i . 

a  —*0  a  — >0 


Since 


and 


Var  [li  )=Ci  ^  Var  ( [/,■  )+s,-  ^  Var  ( V,-  )=  Var  ( C/,-  )=  Var  ( V,-  )=  Har  ( Q,- ) 


M+1 

rar, ([/,■)=  E«V)^W)-  S  (i.io) 

(=1  /  =  ii=i 

with  a  similar  expression  for  Hari(V,-),  it  is  seen  from  (1.14)  that  the  derivatives  of 
I^ari(/,)  and  Vari(Qi)  are  proportional  to  a(l,f  ),  hence  bounded,  thus 


-I  A/ +1 

Vim  Var^il)  =  lim  Far i(g)  =  Eo\Vi)  =  Eo\Ui)  =  __  (0  (1-20) 


a— 0  a— 0 

Combining  (1.17)  and  (1.20) 


^iV)  ^  E,\Q)  na^A^G^{f)G{A) 

. .  EMi) 


Var  i(/ )  Var  i{Q) 


(1.21) 


b)  From  (1.15  -  1.16),  the  first  term  in  the  expansion  is 

IE,(I)+QE,{Q)  . 

aAG{f)/\/n  ,=i  ,=i 

=  cos8[IY  COS7,-  +<52  sin7i  ]  +  sme{-lY  sin7i  +<3  E  ^037;  ]  = 


i=l 


i  =1 


i=l 


i=l 


=  V (7^+5^cos(0  -  tan 

0 

where  C  is  the  term  multiplying  cos0,  S  is  the  term  multiplying  sin0,  and  '^i=2'Ki  n  . 
It  is  easy  to  show 


C'24-5'2=(/2+g2) 


E  exp(y7,-) 

i  =  l 


(/-+g2).;,2c.(^) 


Thus, 
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Var^{I) 


+ 


^  ^aG(f 

Var^{Q)  Eo‘{Ui) 


cos{6  -  tan 


(1.22) 


c)  We  neeed  to  show  that  asymptotically  I  and  Q  are  uncorrelated.  Starting  from  (1.8) 
and  invoking  the  appropriate  independence  assumptions,  it  can  be  shown 


E,{IQ  )-E,{I)E,{Q)=  cj{E,iVi  Fy )  -  E ,{Vi)E ,{Vj))  (1.23) 

”  ij 

-  TS'O/CS.CC'i  Uj)-  E,(Ui)E,{Ui))  =  i-  f;  s,c,{Var,{Vi)  -  Var 


"1.1 


i=l 


Expanding, 


S  5,-  c,  ( Far  o(  V)- )-  Far  o(  Ui ))  +  ^  E 


d(Faro(F,)-Faro(17,)) 

da 


0<a'  <a 


<  a  max 
i 


d(Faro(F,)-Faro(f/,)) 

da 


0<a'  <a 


0(a) 


Thus,  as  the  variances  are  0(1), 

E,{IQ)-E,{nE,{Q)  ^ 

^/Vary{I)Vary[Q) 

Notice  that  cancellation  of  the  zeroth  order  of  the  covariance  is  due  to  the  rotation 

transformation,  which  results  in  (1.23).  In  fact,  it  can  be  shown  that  the  derivative  of  the 

M  +  l 

difference  is  proportional  to  E  «Af+i(Oa(^  /  )•  Thus,  with  the  additional  assumption 

l=i 

of  symmetric  /  this  term  also  vanishes,  and  the  correlation  coefficient  is  even  O  (a^) 
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Appendix  J 

Monotonicity  of  the  error  probability  in  digital  noncoherent 
communication  with  respect  to  the  SNR  ^ 


In  Chap.  8,  it  was  shown  that  the  generalized  SNR  P  [I ,  f  )  possess  a  minimax  pro¬ 
perty,  in  the  particular  class  of  AWNGN  processes  and  in  the  class  of  decision  rules 
based  on  either  limiter-correlators  or  on  generalized  M-estimators.  To  show  a  minimax 
property  on  the  error  probability,  it  suffices  to  show  that  the  error  probability  of  the 
optimal  M-ary  decision  rules  for  the  AWGN  channel  is  a  monotonic  function  of  the 
SNR.  While  this  seems  obvious  and  is  evident  from  all  the  graphic  exhibits  in  the  litera¬ 
ture,  we  have  not  find  any  general  proof  for  arbitrary  signaling  schemes  and  noncoherent 
reception.  In  the  following,  we  demonstrate  it  for  the  important  case  of  orthogonal  sig¬ 
nals.  We  denote  by  X=€/Nq  the  SNR. 

The  probability  of  correct  decision  is  given  by,  cf.  [65], 


(J.l) 


Differentiating,  we  have  to  show  the  following  for  all  X  >0  and  M>2 


(J.2) 


Expanding  in  a  Taylor  series  at  A’=0,  it  suffices  to  show  that  the  coefficients  af,  are 
positive  for  all  k>0,  where 


(J.3) 


Define  a  generating  function  Pk{t)  to  be  given  by  the  same  expression  for  a/.,  but 
the  summands  are  multiplied  by  G  ;  thus  (0)=0  and  P,j.  (!)=«*•  ^Ve  will  prove  that 
Pk{t)  is  positive  for  all  t  >0,  by  mathematical  induction  on  A; .  a)  Assume  Pj._i(i)  is 

^  The  initial  part  of  the  proof  was  suggested  by  Avner  Dor. 
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positive,  than  it  is  easy  to  see  that 

P,{t)  =  j^Udlldy  (J,4) 

and  is  positive  by  the  assumption,  b)  To  show  that  P^t)  is  positive,  let 
P  )=9  {i)  -  h{t),  where 

M  ^ 

9{t)=  =  (1-0^-  1  =  Mj{l-y)^^-^dy  (J.5) 

0 

(the  middle  equality  follows  from  the  binomial  expansion),  and 

'■(<)=  (J.6) 

i=l  * 

It  ie  easy  to  see  that  th{t)  =  (1-/  )^-  1,  thus,  as  h  (0)=0, 

A  (0  ~  j  (J,7) 

0  y 

Combining  with  (J.5), 

f  o(0  =  ,  (0  -  A  (0  =  j. (J,8) 

0  2/ 

It  can  be  seen  that  the  numerator  of  the  integrand  is  positive  for  0<y  <t  <1,  thus 
P o{t)  is  clearly  positive  and  the  proof  is  completed. 
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